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LIGAND BINDING VARIABLE DOMAIN (V-NIN) COMPRISING A FRAMEWORK REGION WITH 
A CYCLICALLY PERMUTED CENTRAL ^-BARREL 

The present invention relates to ligand binding variable 
domains (V-mins) having structural rearrangement of the binding domain 
of innnunoglobulins and other ligand binding members of the 
immunoglobulin super family. 

An imrounoglobiain consists of a pair of light chains (each 
light chain, L, comprising approximately 220 residues) and a pair of 
heavy chains (each heavy chain, H, comprising approximately 400 
residues) . Each ch ai n contains domains of approximately 100 residues 
in the form of fi sheets. (Kabat, E.A. , Wu, Bilofsky, H,, 

Reid-Milner, M. , Perry, H., 1983, Sequences of Proteins of 
Immunological Interest, Public Health Service N.I.H. Washington DC). 
The sequence variable N- terminal domains in the light and heavy chains 
(V^ and Vjj) associate to form a substructture (Pv) which contains the 
antigen recognition site. and domain fragments isolated by 

proteolysis (Hochman, J., Inbar, D., Givol, D. Biochemistry, 1973, 12, 
1130 - 1135) or recombinant methods (US 4,642,334 from DNAX 
Corporation) have been shown to fold and combine to form an Fv 
fragment which retains antigen binding activity. 

The advent of genetic engineering techniques has opened up 
the possibility of engineering antibodies, but since a different gene 
codes for each of the polypeptide chains of an antibody it is 
necessary to clone and express a gene for each polypeptide chain. 
These genes may be either expressed in separate hosts, in which case 
the resiilting polypeptide chains from each host must be allowed to 
refold in solution and combine, or the genes may be expressed in the 
same host simultaneously in which case folding and association into 
the native structure with biologic«a activity can occur after 
e3(pression. The need to achieve e^qiression . of multiple genes, whether 
simultaneously in the same host or in mtatiple and different hosts, 
has proved to be problematic and inefficient. The intact of genetic 
engineering on monoclonal antibodies has been reviewed by Winter & 
Milstein in Nature, 349 . 1991, 293-299. 

The Fv fragment represents a known antigen recognition' 
factor comprising only about 220 residues out of approximately 1500 
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residues in the intact: antibody, it still comprises two polypeptide 
chains. A single chain F^-like molecule has,heen described in US 
patent 4,946,778 of Genex Corporation in which a single peptide linker 
sec[uence foreign to the native Fv fragment is introduced to connect 
the C-terminus of one chain to the N terminus of the other chain. 
Three such constructs (TRY59, TRYSl & TRY104B) having 234-250 amino 
acids were made and tested for binding. Numerous references in Genex 
are made to the fact that loss of residues from the native protein is 
to be minimised (e.g, col. 7, lines 23-24; col. 16, lines 45-47, 57) 
presxuneably fearing that loss of sequence would lead to loss of 
activity and/or stability. However on occasions when modelling a 
single linker does not appear feasible a 2 linker construct is 
suggested. Note the 2 linker approach is only suggested when the one 
linker approach does not work (col, 7, lines 28-29) . One such 2 
linker construct (TRY 40) having 225 amino acids was prepared and 
tested and again the reader is taught to minimise the loss of native 
protein sequences (col. 27, line 6) . An error has been noted in 
Genex 's description of construct TRY 40. The sequence listed in col. 
33, lines 63-64 of Genex is not consistent with the full listing of 
the construct in Pig. 24 therein. The 2 linker construct TRY 40 
described in Genex has lost residues I«41-43 and H105-109 from the 
strands; thus destroying 3 inter-strand H- bonds and one fi- bulge. The 
disruption of the strands in the centrsa S-barrel is problematic since 
it destroys part of the key element for maintaining a robust structure 
in the variable region. 

The present invention is based on recognising that the 
minimal elements necessary to retain good antigen binding affinity, 
the conplementcurity determining region (CDRs) and the central beta 
barrel structure for the Fv fragment, may be retained positioned 
appropriately in space by cyclic permutation of the central beta 
barrel structure and leaving the strands of the S barrel intact. The 
present invention is also independently based on recognising that 
after cyclic permutation of the central beta barrel the framework 
region of the Fv can be pared down through creation of novel linkages 
in the structure. 

According to one aspect of the present invention there is 
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provided a ligand binding variable domain (V-min) comprising: 

(a) a cyclically permuted central S-barrel having intact strands; 

(b) outer S- sheet segments; 

(c) complementarity determining regions (CDRs) ; and 

(d) linker sections for linking integers (a) , (b) & (c) into a v-min, 
whereby said central S-barrel, optionally in combination with said 
outer S-sheet segments > orients said CDRs in space for binding ligand. 

The term "central fi-barrel" means the 8 stranded cylindrical 
barrel formed (at the VL-VH interface) by packing of two fi-sheets (see 
Figure 3 in Novotny et al., 1983, Journal of Biological Chemistry, 
258, 14433-14437) . Note that Figure 3 in Novotny denotes the barxel 
as 9 stranded compared with 8 strands in our model (see Figure ij but 
the inclusion of an extra strand (5 strands from the heavy chain 
rather than 4) is in our view incorrect due to it not representing a 
true strand because of insufficient H-bonding to any of the 8 
principal strands. Elements of the extra strand in Novotny are more 
correctly denoted as outer 6-sheet segments (see below). Preferably 
the linker sections provide covalent linking. 

The term "cyclically permuted central fi-barrel" means that 
the linear amino acid sequence order of appearance of strands in the 
central IS- barrel is permuted compared with the linear amino acid 
sequence order of appearance of strands in a parent central S- barrel 
of an antibody. 

The term "having intact strands" means that the individual 
strands are mostly con^jlete and undisrupted whereby interactions such 
as H-bonds between strands of the central E-barrel are generally 
maintained despite cyclic permutation of the barrel. The following 
(canonically numbered) sequences define the strands: L33-40; L41-50, 
L84-90, ti97-104; H33-41; H42-52; HB8-95 and H102-109. Preferably at 
most one inter- strand H-bond is lost and especially no inter-strand 
H-bonds are lost. Conveniently no S-bulges are lost from the strands - 
Preferably at most 3 a min o acids are lost from the strands, more 
preferably at most 2 amino acids are lost from the strands, more 
preferably at most 1 amino acid is lost from the strands and 
especially the strands are completely intact. 
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Tlie term "outer fi-sheet segments" means segments of fi- sheet, 
not being part of the central 6-barrel of an. antibody, which 
participate with the central fi-barrel in providing a framework for 
orienting CDRs for binding ligand. 

The term "CDRs" means the hypervariable polypeptide loops 
which together determine ligand binding specificity and affinity; said 
CDRs are attached to a central S-barrel and/or outer IS-sheet segments. 
Preferably there are 4-6 CDRs, more preferably 5-6 CDRs and especially 
there are 6 CDRs. Preferably the V-min variable domain is derived 
from an antibody although other ligand binding members of the 
immunoglobulin superfamily such as T-cell receptors are also 
contemplated. 

Preferably the CDRs substantially reproduce the spatial 
orientation of CDRs from a parent antibody and substantially retain 
the ligand binding specificity and affinity of the parent antibody. 

The term "linker sections" includes sequences from any 
protein (for example foreign sequences) such as for example those on 
the Brookhaven Protein Data Bank (abbreviated PDB herein: the relevant 
address is Protein Data Bank, Chemistry Department, Building 555, 
Brookhaven National LeODoratory, Upton NY11973, USA and; a journal 
reference is Bernstein, F.C. et al . J. Mol. Biol., (1977) 112 , 
535-542). 

The term "ligend binding" includes antigen -antibody type 
binding recognition and also ligand binding in the sense of catalytic 
functions of antibodies (see Jacobs, iJ.W. in Bio/Technology 5^: 1991, 
258-262; and Tramontano, A. in Methods in Enzymology ^Tg., 531-568) . 

Preferably in the V-min the framework integers (a) , (b) and 
(d) per se have ^xpto about 180 amino acids. The framework region is 
defined as any part of the variable domaim other than the CDRs . 

Preferably the central fi-barrel and outer fi-sheet segments 
are from the same antibody (i.e. parent antibody) vrtiich may or may not 
be the parent antibody from which the CDRs are derived. In some 
circxjmstances it is advantageous to reduce immunogenicity, for exan^le 
for therapeutic use in humans. To date human antibodies have proved 
difficult to produce for therapeutic use, therefore alternative 
approaches have been sought. One such approach is to "humanise" 
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antibodies of non-human origin. For example CDRs from a moiise parent 
antibody may be grafted onto a hiiman parent ^tibody framework region 
using known techniques (see EP 239400) thus reducing antigenicity in 
humans compared with the complete mouse parent. Another exaxnple is to 
graft a complete mouse variable region onto a human constant domain 
(see US 4 8X6 567) . Another option is "antibody veneering" where 
surface residues of an antibody are altered to mimic a human antibody 
(George Mark presentation at 2nd Annual IBC Conference on Antibody 
Engineering, December 16-18, 1991, San Diego, USA and: EP519596 
Merck/NXH, Padlan et al) , 

According to another aspect of the present invention there 
is provided a ligand bindi n g variable domain (V-min) comprising: 

A) a framework region comprisdLng: 

(a) a cyclically permuted central S-barrel; 

(b) outer beta-sheet segments and; 
<c) linker sections; and 

B) complementarity determining regions (CDRs) ; 

whereby said central fi-barrel, optionally in combination with said 
outer S- sheet segments, orients said CDRs in space for binding ligand; 
the linker sections provide linking between A) & B) to form the V-min, 
in which the framework region per se has upto about 180 amino acids. 
Preferably the central beta barrel has intact strands. 

Preferably the V-min possesses an affinity for ligand of at 
at least 10E-5M; more preferably the V-min possesses an affinity for 
ligand of at least 10E-6M; more preferably the V-min possesses an 
affinity for ligand of at least lOE-TM, more preferably the V-min 
possesses an affinity for ligand of at least 10E-8M, more preferably 
the V-min possesses an affinity for ligand of at least 10B-9M, more 
preferably the V-min possesses an affinity for ligand of at least 
lOE-lOM and especially the V-min possesses an affinity for ligand of 
at least lOE-llM. In general, when a V-min is based on a parent 
antibody, the V-min preferably has an affinity for ligand at least 
about the same as that of the parent antibody. Modification of V-min 
to give increased affinity compared with a parent cuitibody is 
conten^lated . 

Preferably the V-min has less than 2 covalent linkages 
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between the heavy and light chains , 

Preferably there is provided a v-rain as hereinbefore 
described in the form of one continuous polypeptide. This aspect 
simplifies recombinant production of the V-min. However chemical 
synthesis of V-mins is also contemplated. 

Preferably there is provided a V-min as hereinbefore 
described comprising the following core segments: light chain residues 
21-55; light chain residues 84-104; heavy chain residues 21-60 and; 
heavy chain residues 88-109. This aspect has* the advantage of 
preserving the tight fi-ttims in the heavy and light nhaiiis around 
residue 40. 

Pref ereQjly there is provided a v-min as hereinbefore 
described wherein the linker sections essentially consist of sequences 
from the antibody used as the source of the central fi-barrel. This 
aspect has the advantage of substantially avoiding the introduction of 
foreign sequence into the V-mdLn. Thus the modelling process is 
simplified since conformation of foreign linkers does not have to be 
modelled; fiirtherraore iraraunogenicity is reduced in the host species 
used to provide the auitibody framework. 

Preferably there is provided a V-min as hereinbefore 
described, having less than about 180 amino acids in the framework 
regions. Preferably the V-rain has less than about 175 amino acids, in 
the framework regions, more preferably the V-min has less than about 
170 amino acids in the framework regions, more preferably the V-min 
has less than about 165 amino acids in the frsuaework regions, more 
preferably the V-mdLn has less than about 160 amino acids in the 
framework regions, more preferably the V-min has less than about 150 
amino acids in the framework regions, more preferably the V-min has 
less than about 140 amino acids in the fraunework regions, more 
preferably the V-min is less than about 130 amino acids in the 
framework regions and especially the v-min has less than about 120 
amino acids in the framework regions . 

Chothia C. and Lesk A.M. in Journal of Molecular Biology, 1987, 196, 
901-917 define the canonical CDRs as: 
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1*26-32 L50-52 LSl-SS 
H26-32 H53-55 H96-101 

which means that the canonical Framework regions are: 

Ll-25 L33-49 L53-50 L97-107 Fv 91 residues 

Hl-25. H33-52 H56-95 H102-112 Fv 96 residues 

total 187 residues 

This compares with one exemplified canonical V-min framework described 
in the examples set out below: 



H87-95 H102-109 H15-25 H33-52 H56.H64 LI.7 L21-25 L33-49 L53.63 1.79- 



90 



11 20 9 7 5 17 11 12 



L97-104 



total 117 residues 

i.e. a reduction of 37% (calculated from a reduction of 70 residues 
from the original 187 residues) . Molecular Design and Modelling 
concepts and Applications are described in Methods in Enzymology, 
volumes 202 and 203, Academic Press. 

Minimally sized antibodies are desirable for certain 
applications such as for example where improved tissue penetration or 
fast clearance from the body is iu5>ortant (Friedman, p,n. et al.. 
Cancer Research 53, 334-339: 1993). Inproved tissue penetration is 
in^rtant in imaging applications and fast clearance from the body is 
important in clearance of toxic drugs from the circulation and in 
antibody directed enzyme prodrug therapy. Ward et, ^ in Nature, 341, 
1989, 544-546 have described the binding activity of single VH domains 
called dAbs. DAbs have the disadvantage of only being able to present 
3 CDRs and exposure of hydrophobic amino acid residues to polar 
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environments gives rise to handling problems such as non-specific 
binding . 

Further advantages of size minimal is at ion are that simpler 
structxires may lend themselves to more specific chemistry or a wider 
range of chemistries during for example conjugation and the 
simplest /most compact structure may be more amenable to 
ollgomerisation of a series of binding elements either in a homo or a 
hetero oligomer. 

According to another aspect of the present invention there 
is provided a multivalent ligand binding moiety comprising at least 
one v-min as hereinbefore described. 

The term "multivalent" means that one mole of binding moiety 
can -bind mor^ than one mole of ligand; said ligand comprising one 
ligand (i.e. monospecific) or more than one ligand (e.g. bispecific) . 
Preferably the multivalent ligand binding moiety is bivalent. For 
example one mole of bivalent ligand binding moiety can bind 2 moles of 
ligand and comprises either 2 ligand binding V-min domains of the 
present invention or one ligand binding V-min domain of the present 
invention and one other ligand binding moiety such as Fab, Pv or 
single chain Fv. 

According to another aspect of the present invention there 
is provided a conjugate comprising a V-min as hereinbefore described 
or a multivalent ligand binding moiety defined as hereinbefore 
described and an effector molecule. 

The term "effector molecule" refers to a molecule capable of 
producing an effect or a signal. Preferably the effector molecule is 
an enzyme, cataaytic antibody, toxin, drug or radionuclide and 
especially the effector molecule is an enzyme or catalytic antibody 
for antibody directed enzyme prodrug therapy (1) , The effector 
molecule may be covalently and/or non-covalently linked to the V-min. 
Known techniques for covalent linkage include heterobifunctional 
linkers and protein fusion. A known technique for non- covalent 
linkage is via leucine zipper motifs and the like (Kostelny, S.A. e^ 
al-/ J- Immunol. 148 1992, 1547 - 1553 and Pack, P. et al . . 
Biochemistry 3i 1992, 1579-1584) . In some cases a mixture of both 
covalent and non-covalent linkage is used. For example a polylysine 
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moiety may be covalently linked to a V-min and the positive charge on 
Che poiylysine moiety may non- covalently link a complementary charged 
moiety such as nucleic acid. Another exatrple is covalent linkage of a 
chelate to a V-min; the chelate then non- covalently binding a moiety 
such as a radionuclide. 

According to another aspect of the present invention there 
is provided a polynucleotide sequence encoding a V-min as hereinbefore 
described or a multivalent ligand binding moiety as hereinbefore 
described. 

According to another aspect of the present invention there 
is provided an expression vector comprising a polynucleotide sequence 
as hereinbefore described. The expression vector may direct secretion 
of translated polypeptide chain for facilitating folding of the chain 
into a ligand binding conformation. Conveniently the expression 
. vector directs expression of translated polypeptide for intracellular 
acctmiulation, especially as inclusion bodies. 

According to another aspect of the present invention there 
is provided a host cell comprising an expression vector as 
hereinbefore described. Preferably the host cell is procaryotic and 
especially the host cell is E. coll . 

According to another aspect of the present invention there 
is provided a method of producing a V-min as hereinbefore described or 
a multivalent ligand b in di n g moiety as hereinbefore described 
comprising culture of a host cell as hereinbefore described. 
Methods for production and analysis of single chain antibody molecules 
as well as general antibody engineering techniques have been set out 
in the following publications : 

1. BEDZYK, W.D.: J. Biol . Chem. , (1990), 18615-18620 

2. HUSTON, J.S.; Proc. Natl. Acad. Sci., (1988), 85, 5879-5883 

3. BIRD, R.E.: Science, 242, (1988), 423-426 

4. WHITLOW, M. ; Methods, A Cott^anion to Methods in Enzymology, 2(2), 
(1991) , 97-105 
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5. WARD, E.s. : Antibody Engineering, A Practical Guide, C.A.K. 
Borraebaeck (Ed), WiH. Freeman New York (1992), Ch 6 : 121-137 

6. ANAND, N.N.: J. Biol, Chem. , 266(32) , (1991) , 21874-21879 

7. SKERRA, A.: Current Opinion in Immunology, 5, (1993), 256-262 

8. PLUCKTHUN, A.: Bio/technology, 9, (1991), 545-551 

9- GLOCKSHOBER, R. : Biochemistry, 29, (1990), 1362-1367 

10* SKBRRA A. : Bio/ technology, 9 (1991) 273-278 

11. PLUCKTHUN,^ A.: Nature, 342, (1990), 497-498 

12. FORD, CP.: Protein Expression & Purification, 2, (1991), 95-107 

13. PLUCKTHUN. A.: Immunological Reviews, 130 , (1992), 151-188 

14. HOIiLINGER, P.: Current Opinion in Biotechnology, 4 (1993), 
446-449 

15. BRINKMANN, U: Proc. Natl. Acad. Sci. USA, 90 (1993), 7538-7542. 

16. Methods in Enzymology, vol 178. 

Analogous methodology can be applied to v-mins described 
herein. Inspection of databases has indicated over 100 pxiblications 
at present on single chain antibody molecxiles covering many different 
antibodies thus indicating the general applicability of modelling 
concepts across this family of molecules. 

According to another aspect of the present invention there 
is provided a pharmaceutical composition comprising a conjugate as 
hereinbefore described an active ingredient in association with a 
pharmaceutical excipient or carrier. Conveniently the con^osition 
con^rises an effective amount of the active ingredient. 
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According to another aspect of the present invention there 
is provided a radioimaging composition comprising a conjugate as 
hereinbefore described as cux radioimaging ingredient in association 
with a pharmaceutical excipient or carrier. Conveniently the 
composition comprises an effective amount of the radioimaging 
ingredient. 

The reader is referred to Comprehensive Medicinal Chemistry, 
Volume 5 (Biopharmaceutics) , Hansch (Editor), Pergamon Press 1990; 
especially chapters 25.2 (formulation), 25.3 (Dosage Regimes) £uid 25.5 
(Drug Targetting) . Section 25. 5. 5.1 therein discusses immunotoxins 
and Section 25.5.5.3 discusses antibody- radionuclide conjugates for 
imaging cUid radiotherapeutic purposes. Radioimaging is described in 
Larson (1987) Radiotherapy 1£5, 297. A reference work on toxins is 
"Genetically Engineered Toxins" edited by A.E. Frankel, Marcel Dekter 
Xnc . 1992 . Imnrunotoxin Prepciration is discussed in chapters 20 and 21 
of Methods in Enzymology, Vol 93, Academic Press 1983. 

According to another aspect of the present invention there 
is provided a method of treatment of a tumo\ir comprising the 
administration to a humem or animal in need of such treatment of a 
pharmaceutically effective amount of a conjugate as hereinbefore 
described. 

According to another aspect of the present invention there 
is provided an immEunotoxin pharmaceutical composition comprising a 
V-min as hereinberore defined in association with a pharmaceutical 
excipient or carrier. 

According to another aspect of the present invention there 
is provided a radioimaging composition comprising a V-min as 
hereinbef orie defined in association with a pharmaceutical excipient or 
carrier . 

According to another aspect of the present invention there 
is provided an antibody- enzyme pharmaceutical con^osition for ADEPT 
comprising a V-min as hereinbefore defined. Many cytotoxic coii^ouhds 
have been discovered which are of potential use in Cemcer 
chemotherapy- Nitrogen mustards form^ one important family of such 
cytotoxic compounds. The clinical use of cytotoxic compounds in 
general and nitrogen mustards in particular has been limited because 
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of tlie poor selectivity in the cytotoxic effect between tumour cells 
and normal cells. One approach to overcome this problem has involved 
the development of so-called pro-drugs which are derivatives of the 
cytotoxic drug, often a relatively simple derivative, whose cytotoxic 
properties are considerably reduced compared to those of the parent 
drug . Proposals have been made for the administration of such 
pro-drugs to patients under regimes whereby the pro-drug is only 
converted to the cytotoxic drug in the region of the intended site of 
action. One approach involves linkage of a cytotoxic parent nitrogen 
mustard with an amino acid to form a pro-drug which can be converted 
to the parent nitrogen mustard at the site of intended action under 
the influence of an enzyme. This approach can be put into practise by 
the utilisation of an antibody/ enzyme conjugate in association with a 
pro-drug. The antibody/ enzyme conjugate is formed from an antibody 
selective for tumours and an enzyme that will convert the pro-drug to 
the cytotoxic drug. In clinical practice, the antibody/ enzyme 
conjugate is first administered to the patient and is allowed to bind 
to the tumour. After a suitable period of time, to allow clearance of 
the antibody/enzyme conjugate from the rest of the body, the pro-drug 
is administered to the patient. Conversion of the pro-drug, under the 
influence of the localised enzyme, to the cytotoxic drug takes place 
mainly in the region of the tumour. Such a system (Antibody Directed 
Enzyme Prodrug Therapy or ADEPT) is described in International 
Application PCT/GB8 8/0 0181 published as WO88/07378 and US Patent No 
4,975,278. 

According to another aspect of the present invention there 
is provided a method of treatment of a tumour comprising the 
ad m inistration to a human or animal in need of such treatment of a 
pharmaceutically effective amount of a pharmaceutical composition for 
ADEPT or an immunotoxin pharmaceutical composition. 

According to another aspect of the present invention there 
is provided use of a radioimaging composition as hereinbefore defined 
to produce a radioimage as an aid to diagnosis of a condition in a 
human or animal. 

The invention is illustrated, but not limited, by the 
following examples in which: 
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Figure 1 illustrates the Beta Strand Arrangement of Immunoglobulin 
Variable Regions in which structurally important residues Cysteines 
(L23-L88, H22-H92) indicated by C and Tryptophans 135 and H36 are 
represented by w. The upper section illustrates the outer S-sheets 
for each chain. These sheets overlay the inner sheets shown in the 
lower section of the diagram. Folding along the horizontal crease 
yields the £*sandwiches for the two chains . Juxtaposition of the 
inner sheets (fold along the vertical crease) generates the central 
15-barrel (highlighted). Folditig along the horizontal crease and then 
. rolling the sides of the paper to meet each other generates the inner 
central barrel surroiaided by outer strands etc. 

The dotted lines connecting the strands represent hydrogen bonding 
between strands; straight dotted lines are H-bonds formed between 
antiparallel strands and wavy dotted lines are H-bonds formed between 
parallel strands. 

Figure 2 illustrates the core segments as emboldened sequence. 

Figtxre 3 illustrates the Beta Strand Arrangement of Immunoglobulin 
Variable Regions the V-min Construct 4ra in which: 

I fi- strands above paper 

: fi- strands behind paper 

. vertical linkers 

- horizont2il linkers 

# non- sequential links for V-min 

The emboldened sequence is the V-min construct. 

Figtire 4 illustrates alignment of FAB light rfHa jri ff for F19,9 & IMCP 

Complementarity DetermdLning Loops are underlined. 
Canonical Structure Type (reference 4) is indicated alphabetically 
Framework residues important to Loop conformations are indicated by 

« 

Segments used in the V-min design are demarcated by < >. 

The order of appearance in the V-inin sequence is indicated by s#. 
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Figixre 5 lists: (a) amino acid sequence for F19.9 v-min constructs ; 
(b) amino acid sequence for IMCP V-min. 

Figure 6 lists the initial nucleotide sequence for F19,9 V-min. 

Figure 7 illustrates a V-min construct in bold line with redundant 
sequence in faint line. 

Figure e shows the of-carbon diagram of a final V-min structure. 

Figure 9 shows a V-min compared, with a Fv structure. Faint line on 
the Fv represents sequence discarded to form the V-min. The bottom 
left comer of the V-min represents TAG sequence added. 

Figures 10, lOA & 11 show that the environment of possible problematic 
polar groups in V-min structures have adequate bonding. Figures 10 
and lOA show the environments for residues GluS5 and Lys97 in F19.9 
V-min respectively. Figure 11 shows the environments for Lysioi and 
GlnlOS in IMCP V-min. 

Figure 12 is a diagrammatic representation of Pv, central S barrel and 
minijnal V-min constructs. 

Figure 13 illustrates the V^ £-strand arrangement. 
Figure 14 illustrates the V^ S-strand arrangement. 

Figure 15 illustrates the Fv fi-sandwich arrangement with the heavy 
chain in continuous line and the light chain in broken line. 

Figxxre 16 illustrates a conservative V-min construct including 
redunckant-segments . 

Figure 17 illustrates the conservative V-rain construct of Fig 16 but 
excluding redundant segments . 

Figure 18 illxistrates a minimal V-min construct including redundant 
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segments . 

Figure 19 illustrates the minimal V-min construct of Fig 18 but 
excluding redundant segments . 

Figure 20 illustrates construction of a MCP603 V-min gene from 
oligonucleotides . 

Figure 21 is the peptide sequence of a MCP603 V-min construct 
including c-myc peptide tag. 

Figure 22 illustrates pBluescript. 

Figure 23 illustrates pMIN603. 

Figure 24 illustrates construction of pICI0020. 
Figure 25 illustrates a trp promoter . 
Figure 26 illustrates construction of pTB344. 
Figure 27/27A illustrates construction of pICI0042. 
Figure 28 is a transcription terminator sequence. 
Figure 29 illustrates construction of pLB013. 
Figure 30 illustrates pICI123. 
Figure 31 illustrates pICI1079. 

Figure 32 illustrates construction of pICI1572 . 

Figure 33 illustrates construction of a F19.9 V-min gene using 
oligonucleotides. 
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Figure 34 illustrates a F19.9 v-min peptide 



sequence. 



Figure 35 shows the sequence of pSWl-VHD1.3-VKD1.3-TAGl from the start 
of the first pelB leader sequence to approximately 50 bases beyond the 
end of the TAGl. Numbered from the start of the pelB. Amino acid 
sequence of the pelB-VHDl.3 and pelB-VKDl, 3-TAGl genes are shown below 
the DNA sequence. 

Figure 36 shows PGR generation of DX.3 V-min 

Figure 37 shows products from first round PGR reactions 

Figure 38 shows a plasmid map of pIGI226 

Figure 35 shows a plasmid map of pJFH3037 

Figure 40 shows D1.3 V-min conservative 

Figure 41 shows h65-115/hl3-64/ll-108 D1.3 V-min with Intact Light 
Ghain 

Figure 42 shows 161-ip7/ll3-60/hl-116 D1.3 V-min with Intact Heavy 
• Chain 

Figure 43 shows h87-ll5/hl3-64/ll-7/12X-61/l76-l08 Dl . 3 V-min 
minimal Construct 

Figure 44 shows h67-i25/m3-66/ll-ll2 KOL V-mia with Intact Light 
Chain. 



Figure 45 shows 162-i09/112-61/hl-125 KOL V-min with Intact Heavy 
Chain 



Figure 46 shows h66-ll8/hl3-fi5/ll-ii2 PAC-1 V-min with Intact Light 
Chain 
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Figure 47 shows 166-lll/112-65/hl-123 PAC-1 V-min with Intact Heavy 
Chain 

Figure 48 shows binding data for a Dl,3 V-min 

Table 1 lists Cor-Ca distances for linker sections of the MPC Fv 
fragment 

a) I»55-84 and Iil04-107 versus Ll-21 

b) H60-88 versus Ll-21 

c) , Heo-88 and HlOB-113 versus Hl-21 

d) L55-84 versus Hl-21 

e) L5 5 -84. versus Li04-107 and H60-88 versus H108-113 

f) Intra Section relationships 

Table 2 lists oligonucleotides used in gene assembly 

BXAMPLg 1 kodbxj:.zng studies 

v-min proteins have been designed, modelled and analysed. 
One is based on a known (2) FAB X-ray structure, IMCP (also referred 
to herein as McPC603) , a mouse monoclonal which binds phosphatidyl 
choline. Another is based on F19,9 which lacJcs an experimental 
structure for the FAB fragment. We have therefore generated a FAB 
model for F19.9 by analogy with the known structure of IMCP. This 
homology modelling approach (3) is a well established method for 
generating structural models of proteins that exhibit more than 30%, 
identity to a protein whose 3D structure has been determined 
experimentally. The methodology set out below can be applied to any 
antibody suid was used to generate the V-min structures in the examples 
herein . 

1.1 Background 

The variable Fv region of the FAB structure forms the basis for 
designing the V-min structure and, in the general case, a structural 
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model of the Pv region is generated by homology modelling based on the 
most closely analogous known structure. 

AS indicated earlier, the central eight - s tranded fi- barrel of the Fv 
structure is retained in the v-min design. In the Fv structure, four 
fi- strands . are contributed by the light chain, and four by the heavy 
chain. 

Fovu: of the six conqplementarity determining regions (CDR's) appear as 
apical loops connecting these S-strands (L3, L2, H3 & H2 in Fig l) . 
The remaining two CDR's connect the N-terminal segments of each chain 
to. the first barrel fi-strand in that chain (LX & HI in Fig l) . 
The arrangement of fi- strands in the variable regions of immunoglobulin 
light and heavy chains is shown in Figure 1 using the canonical 
numbering of Chothia and Lesk <4) . 

In Figure 1, the upper section illustrates the outer fi-sheets for each 
chain. These sheets overlay the inner sheets shown in the lower 
section of the diagram. Folding along the horizontal crease (dotted 
line) yields the S-sandwiches for the two chaiixs. 

Juxtaposition of the inner sheets (fold along the vertical crease) 
generates the central C-barrel (highlighted) , 

Figures 13, 14, and 15 also illustrate the arrangement of fi-strands. 
1.2 Tho Minimal Recognition Structure 

In addition to the central fi-barrel, four segments of fi-strand 
(L19-25, Ii53-55, H18-25, H56-60) are required for the generation of 
CDR's-I and II in both chains. 

Residues on either side, in sequence terms, of the CDRs are anchored 
in the central fi-barrel. These three segments per chain are defined 
below : 



Light Chain 21-3 9, 42-55, 84-104 
Heavy Chain 20-40, 43-60, 88-109 

In order to avoid disrv^ting the two tight Stums at L3 9-1,42 and 
H40-H43 we have adopted the following four core segaont model (s^ 
Figure 2) : 
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Two core seowents from tJie Light Chain 

L21-55 ie light chain residues 21-55 (contains CDR's LI- and LII) 
L84-104 ie light chain residues 84-104 (contains CDR LXII) 

Two core secmients from the Heavy rha-iji 

H21-60 ie heavy chain residues 21-60 (contains CDR's HI and HII) 
H88-109 ie heavy chain residues 88-109 (contains CDR HIII) 

^■•3 Single Chain Constructs 

In order to produce a single chain species, these four core segments 
are linked together using spatially appropriate sections from the 
intact structure. These linker sections are indicated below (NB lower 
case is used to differentiate linker sections from the core segments) . 



Section 


Chain 


Rosidue Numbera 


11-20 


Light 


1-20 


156-B3 


Light 


56-83 


1Z05-107 


Light 


105-107 


hl-20 


Heavy- 


1-20 


h61-87 


Heavy 


61-S7 


11109-113 


Heavy 


109-113 



(NB non-natxiral, that is foreign, linker sections could also be 
designed to do this such as for exan^^le using sequences extracted from 
the Brookhaven data bank as outlined in Genex DS4946778) * 

The cyclic nature of the central S-barrel means that the four core 
segments can be combined in different orders . For example, by linking 
the C- terminus of a given segment to any one of the N- termini from the 
three other segments. There are thxis potentially 24 different 
combinations . Their feasibility can be determined by means of the 
spatial relationships of the available elements of the intact 
structure (ie the linker sections) that can be used to join the core 
segments. The adjacency of these linker sections can be evaluated in 
terms of the distances between the a-carbons of the residues ' involved 
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(see Table 1) . 

These distance tables can be vised to produce an adjacency matrix of 
possible linkages- If any a-carbon in one linker section is 
sufficiently close to any of suiother we define the sections as 
adjacent. The cut-off distance that we have used is 15A, since this 
allows for the reorientation of up to six residues in order to create 
a contiguous structiure. In the following table, + indicates sections 
that are adjacent by the sUsove definition, - indicates sections that 
are too distant to connect and 0 represents self with self. 

11-20 156-83 1105-107 - hl-20 h61-87 hl09-113 



11-20 

156-83 

1105-107 

hl-20 

h61-87 

hl09-113 



0 
+ 
+ 



0 



+ 
0 



0 
+ 
+ 



+ 
0 
+ 



+ 

+ 
0 



We can enumerate the possible combinations of linker segments via the 
adjacency diagram shown below, where double-headed arrows indicate two 
ad j acent sections . 



107-105 

/ \ 

/ \ 
1-20. 21-55 -56-83 



Light Chain 



I I 
I I 

V V 

87-61> 60-21 >20-l 

\ / 
68-108 / 

\ / 
109-113 



Core Segments are. underlined 



Heavy Chain 



In the following, we use example residues to illustrate possible 
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connections. Residues in the core segments (upper case) and linker 
sections (lower case) are separated by Wh^re segments or sections 
are separated by : the sequence is continuous as for the native 
protein. Where / is used a novel linkage between segments is formed. 

The lack of a direct connection between 1105-107 and hl-20 (or 
hl09-li3 and 11-20) means that the native sequence order can only be 
preserved by use of a foreign linker section ( 1 1 ) : - 

11-20 :L21-55:l56-83sL84-X04:1105-107/U/hl-20:H21.60:h61-87:H88.108:hl 
09-113 

or vice versa. 

Some of the proximal distances are non-productive in that they would 
connect a single segment in a closed fashion. For example; the 
distance between light chain residues 20 and 74 is 4A but this merely 
connects linker sections 156-83 to 11-20, thus enclosing the core 
segment L21-55. 



11 20:L21 55:156------- 83 

74 

Exploiting the adjacency diagram, in];e£-chain connections involve: 

11-20 <----> h61-87, or hl-20 < > 156-83 

and the following four combinations may be possible : 

1. L21-55 :l56-83 ;L84-104/ ll-20/h61-S7 rHflfl-lQS ;h-LQQ.T ^ t />.t -on /xr-y^ .^r^ 

2. H21-60:h61-87;H88-108/ hl-20/l56-83 *La4-lQ4.1i^og-Tn'7/n -Oft/T,?l-55 

3. L84-104:1105-107/lX-20;L21-55; l56-83/hl-2Q =H21-6Q=Hgi,B-7.WQQ-tnp 
hl09-113 

4. H8a-108 ;hl09-113/hl-20:H21-60r h61-87/ll-2Q :L21-55 = l qg,fl-^.T.ftA-inA , 
1105-107 
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m choosing specific residues from the linking sections to generate 
the connected sequences, we make use of the distance tables to 
identify candidates. 

In Ppnstruct l , residues from the linker sections 11-20 and h61-87 are 
to be used to connect core segments L84-104 and H88-108. Analysis of 
the distance tables suggests that light chain residue 1 can connect to 
heavy chain residue 61, and that light chain residue 104 could link to 
light chain residue 13. The two candidates from 11-20, ie 13 and l, 
also need to be connected. These two residues are connected in 
sequence 11-13, unfortunately, the direction would need to be reversed 
in this context. 

-— > — >87 

-> i 

> I I 

> I 

— 11-— < 13 20 Hsi-ioe 

sequence reverses 
This process reveals that constructs i aaS 2 are unlikely unless 
foreign linkers (II) are used. 

1. r<21-104/Lil/H/ia/H61-113/H13-S5 

2. H21-108/H13/U/H1/LS7-107/L13-60 

gonstructs 3 and 4 appear to be viable and represent generic 
constructs that produce single chain species in which the sequential 
order of the CDR's is changed and novel linkages using native residues 
are introduced. 

In construct 3^ the sequential order of the light chain is reorganised 
to provide a connection to the intact heavy chain, whilst in construct 
4 the heavy chain is reorganised and the light chain preserved. 
Choosing specific connecting residues by means of the distance tables 
yields the following exemplifications: 



3a. I.83-107/L13-57/H1-113 
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4a. H87-113/H13-64/L1-107 
Minimal Single Chain Constructs 

The distance tables indicate that constructs 3 and 4/ which consist of 
approximately 185 residues, may be reduced in size by: 

a) Omitting sequence that spans key segments which are close in space. 

Thus in construct 3, section hl-20:H21-60 can be replaced by 
hl-6/H2l-60 thereby saving fifteen residues. 

II 

I I < omitted secjuence 

I I 
I |20 

. . H21 €0 



hi 



b) Splicing those candidate jxinctions which minimise the number of 
residues needed. 

Thus in construct 4, hl09-ll3/hl3-20 can become hi09/hl5-20, a 
saving of six residues. 



hl09-- --113 

/ : 

20-< < hl3 

15 

Such considerations lead to the following minimal single chain 
constructs of about 150 residues in which similar numbeirs of residues 
are contributed by the originating chains - 

3m. L83-106/L15-S7/H1-6/H21-60/H88-113 
4m. H87-109/H15-64/I,l-7/L21-63/L79-104 

We how further illustrate the approach by .adopting construct 4m 
(Figure 3) for further modelling analysis. 

1.4 Alignment and V-min . constructs 
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The alignment of F19.9 with IMCP was performed within QUANTA (5) and 
topological consequences were checked using computer graphics 

The F19.9/1MCP alignment is shown in Figure 4 and the V-min constructs 
are as follows : 

si s2 S3 s4 55 

Canonical H87-H109/H15-H64/Ll-.I.7/IJ21-L63/L79-ia04 
IMCP H93-H118/H15-H67/ia-L7/L21-L69/L85-L110 
F19.9 H93-Hlll/H15-H67/ia-L7/L21-L63/L79-L104 

The restiltant amino acid sequences are shown in Figure 5 with 
associated C- terminal TAG residues (used to aid subsequent detection 
in binding assays) and N- terminal modifications which may result from 
cloning into certain expression vectors. An initial nucleotide 
sequence for F19.9 V-min is shown in Figure 6. 

In Figure 4, the V-min segments, and the CDR's are indicated. In 
addition, certain framework residues which Chothia suggests (4) are 
irapiortant to CDR loop conformations are identified. The V-min 
construct shown above incorporates five of these residues, however 
three are omitted. Of these, L64 Gly (IMCP L70) (in^jortant to CDR Lii) 
and L71 Tyr (IMCP L77 Phe) (important to CDR LI) could be restored by 
replacing s4 & s5 with L21-L104 (IMCP L21-L110) . 

Restoration of the remaining residue H71 Arg (H74 in F19.9 and IMCP) 

which is important to the juxtaposition of CDR's HI and HII could be 

achieved by initiating the sequence earlier than H87, eg at H68 (H71 

F19.9 and IMCP) . Alternatively, it may be possible to mutate eg H49 

(H52 Asn in F19.9 and IMCP) to reinforce the loop positions. 

Further illustration of the V-min concept is provided in Figures 12 - 

19. 



1.5 Kodelllng the V-nln Proteins 

1.5.1 Introduction 

The first step is to identify an x-ray structure of a Fab on which to 
base the model. In the case of IMCP the X-ray structure has been 
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determined. In general, the decision is based on the degree of 
identity between the seqtience. of known Fab structures and the target 
protein. Particular eii?>hasis is placed on the relationship of the 
corresponding CDR loop sequences and their associated conformational 
determinants from the framework (4) . Numerous Fab X-ray structures 
are available in the Brookhaven data base. The IMCP Fab structure 
represents a viable starting point for the development of a homology 
model of F19.9. The Fv portions of the Fab structures provide useful 
reference points for evaluating the feasibility of the V-min models in 
terms of the residue environments etc. 
1.S.2 IMCP V-min 

The regions of the IMCP structure which are not involved in the v-min 
construct were deleted using the QraUTA (5) utility. The remaining 
structure was output as a PDB file which was edited to the proposed 
V-min order. The resulting PDB file was imported into QTONTA as an 
initial V-min structure. 

this structure required remodelling of the junctions between segments 
to create a connected structure. This was undertaken using an 
annealing protocol which can be generalised as follows: 
Fix all atoms other than those in the annealing zone. 
The annealing zone consists of ca four to six residues (eg two on 
either side of the proposed -join") . The size of this annealing zone 
is reflected in the 15 k criterion for "adjacency" used earlier. 
Perform constrained optimisation using CHAHMM (5) until residual 
gradient is less than 1 Kcal/A 

Check for torsion angle violations (eg cis peptides etc). 

If necessary, incorporate appropriate dihedral angle constraints and 

re-optimise. 

After all junctions were annealed, the resulting structure was energy 
relaxed in two stages, firstly, all side chain atoms were allowed to 
relax and this was followed by a full energy relaxation. The 
disulphide distemces were constrained in these steps. 

At this stage, the »tag" sequence was inserted after the C-terminal 
residue. The "tag" sequence plus the last two V-min residues formed an 
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annealing zone and the protocol described earlier was followed. 
Additional distance constraints were applied, to bring the non-polar 
residues in the tag sequence into contact. 

A final full energy relaxation was applied and an a-carbon diagram is 
shown in Figure 8; a corresponding Figure 7 shows the V-min construct 
in bold and redundant sequence in faint line. 

1-5.3 F19.9 V-min 

The proposed amino acid sequence file for this construct + "tag" was 
generated and QUANTA utilities used to map the sequence onto the 
structure of IMCP V-min + "tag- generated earlier. As can be seen in 
Figure 4, two deleted sections with respect to IMCP occur in the CDR 
loops. Thus, two annealing steps were undertaken followed by energy 
relaxation of sidechains and, finally, of all atoms as described 
earlier for the IMCP V-min model. 

Two candidates for the N-terminal modification are indicated in Figure 
5. The shorter candidate requires a glycine to be replaced by leucine. 
Initial inspection of the V-min model indicated, that this could be 
problematic. Thus, the necessary mutations and insertions were 
accon?)lished using QtlANTA and a zone consisting of the candidate 
sequence plus the next residue was annealed followed by energy 
relaxation. 

1.5.4 Modelling The FAB Structure 

This model was required for con^arative analysis with the F19.9 V-min 
construct. The alignment shown in Figure 4 formed the basis for 
mapping the F19.9 sequence onto the IMCP structure using QUANTA. The 
deletion and insertion zones were annealed as described for the V-mins 
and the side chains were energy relaxed, followed by the full molecule 
also as described earlier. 

1.5.5 Analysis of The V-mln Models 

The following criteria have been applied in assessing feasibility of 
the models, .the residue numbering is that of the V-mins : 
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1.5.5 Potential Energies 

The energy of the system should indicate a l^ck of steric clash, and 
internal strain. In the CHARMM force field using a distance dependent 
dielectric (4r) , a general rule of thumb is that energies /residue 
should be in the range -4 ± i Kcal/mole. 

Potential Energy 
kc^il/TTiole kcal/reaidnA 
P19.9 V-min + tag ,556 ,2.6 

F19.9 V-min/ tag/N- term -543 .3,4 

IMCP V-min + tag sas ,4 ^ 

1 • 5 • 5 . 2 Dihedral angles 

Peptide omega angles (C'-N torsion) should be trans (I8O0 +/. 200). 
Cis peptides are allowed if they precede proline (7), 
Main chain phi. angles (N-Cof torsion) should be negative for L-amino 
acids. Positive values are acceptable for glycine residues (7). 



Positve 


Phi Angles 


P19.9 V-min 


IMCP V-min 


Gly 16 


Gly 23 


Gly 21 


Gly 28 


Gly 31 


Gly 38 


Asn 62 


Lys 69 positive 




Gin 100 + in IMCP 


Gly 99 


Gly 112. 


Arg 108 


Gly 121 + in IMCP 


Gly 115 


Gly 128 


Gly 142 


Gly 155 



Comment 



Cis Omega Angles 
£19^ V-min l^CP V-min Comment 
Phe 135 Pro 136 Tyr 148 Pro 149 cis in IMCP Xray 



wo 94/12625 



28 - 



PCT/GB93/02375 



Smssa angles outside the range 180^ +/- y^Q 



F19. 


9 V- 


mxiA w 


IMCP V- 


min w 


Comment 


ASD 


72 




Asp 


79 


159 


jiinction s2-s3 






-152 


Asp 


136 


-152 


jxinction s4-s5 


Leu 


130 


159 


Gin 


143 


159 




Lys 


144 


153 


Lys 


157 


154 


junction s5-tag 


Glu 


146 


156 


Glu 


159 


156 


The following are 


lie 


150 


154 


He 


163 


154 


residues from the 


Ser 


151 


154 


Ser 


164 


154 


"tag" sequence and 


Glu 


153 


152 


Glu 


166 


152 


less well modelled 



1 • 5 • 5 • 3 Bydrogen Bonds 

For folded proteins main chain hydrogen bonds occur at an average 
frequency of 60 ± 20 for every 100 possible (8) . 



F19.9 V-rain 
IMCP V-min 
IMCP Fab (Xray) 
F19.9 Fab 



H- Bonds /; i^OQ 
All Main Chain iS- sheet 
55 22 
57 25 
59 31 
63 29 



Turns 
22 
21 
15 
18 



1.5.5.4 fixposad Non-Polar residues 

in general the surface accessibility of non-polar residues should be 
less than half of that of a Gly-X-Qly tripeptide in the extended 
conformation (9) . in addition, free energies of hydration for residues 
should be negative or zero. 

For the present purpose, we report non-polar residues that are exposed 
and have positive hydration free energies (10) , and which also differ 
significantly from the parent Fab model. 
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F19 


.9 


Accessible Fraction 


Hvdration 


Enerxrv 


(kcal /mol f» ) 






V-min 


Fab 


D 


V-min 


Fab 


D 




Val 


19 


0,1 


0.0 


0.1 


0.6 


0.0 


0.6 


junction sl-s2 


Val 


53 


0.2 


0.0 


0.2 


0.3 


0.0 


0.3 




lie 


56 


0.4 


0,0 


0.4 


0.3 


0.0 


0.3 




Val 


71 


0.5 


0.0 


0.5 


0.7 


0.0 


0.7 


jtmotlon s2-s3 


Xle 


79 


0.8 


0.0 


0.8 


0.1 


0.0 


0.1 


junction 83-84 


He 


106 


0.2 


0.0 


0.2 


0.2 


0,0 


0.2 





Accessible Fractip;^ Hvdration Free Energy (kcal /mole ) 







V-min 


Fab 


2 


V-min 


Fab 


2 




Val 


19 


0.5 


0.1 


0.4 


0.7 


0.2 


0,5 


?? 


Val 


26 


0.1 


P.O 


0.1 


0.6 


0.0 


0.6 


junction 8l-s2 


Val 


78 


0.6 


0.0 


0.6 


0.8 


0.0 


0.8 


junction b2-s3 


X*eu 


95 


0.5 


0.4 


0.1 


0.3 


0.1 


0.2 




Leu 


104 


0.1 


0.0 


0.1 


0,2 


0.2 


0.2 




He 


119 


0.2 


0.0 


0.2 


0.2 


0.0 


0.0 





1.5.5*5 Buried Polar Reaidues 

If the accessible surface area of 
(ie tbat of a water molecule) (8) , 
is positive, the environment must 
requirements of the residue (ii) , 



a polar residue is less than 10 A 
or if the free energy of hydration 
satisfy the hydrogen bonding 
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^3.9, 9 Accessible Area A" Hydration Free Energy ( kcal /mole ) 







V-man 


Fab 


D 


v-min 


Fab 


D 


Tnr 


7 


0 


0 


0 


0.0 


0.0 


0.0 


Asp 


40 


0 


0 


0 


0.0 


0.0 


0.0 


6lu 


55 


2. 


1 


1 


2.6 


-0,2 


2.8 


Ser 


92 


0 


1 


-1 


-0.1 


-0.1 


0.0 


Lys 


97 


56 


38 


18 


0.2 


-1.2 


1.4 


Ser 


101 


6 


9 


-3 


-0.6 


-0.6 


0.0 


Gin 


131 


2 


0 


2 


-0.1 


0.0 


-0.1 


Arg 


137 


5 


2 


3 


-0.5 


-0.1 


-0.4 


Thr 


138 


0 


21 


-21 


0.0 


0.2 


-0.2 


Thr 


143 


3 


0 


3 


-0.7 


0.0 


-0,7 


IMCP 




Accessibl 


e Area 


_ o 


Hydration Free 


Enercrv 






V-min 


Fab 


B 


V-min 


Fab 


D 


Asn 


B 


1 


0 


0 


0.0 


-0.1 


0.1 


Thr 


14 


0 


0 


0 


0.0 


0.0 


0.0 


Gin 


47 


0 


0 


0 


0.0 


0.0 


0.0 


Lys 


101 


54 


44 


10 


0.4 


0.0 


0.4 


Oln 


108 


9 


12 


-3 


-0.4 


-1.3 


0.9 


Gin 


143 


0 


0 


0 


0.0 


0.0 


0.0 


Asn 


144 


0 


0 


0 


0.0 


0.0 


0.0 


Asp 


145 


2 


2 


0 


-0.2 


-0.3 


0.1 


Thr 


156 


3 


0 


0 


0.0 


0.0 


0.0 



?7 



7? 
?7 



1.5*5.6 Summary of Analysis 

The potential energies and hydrogen bonding compositions of the V-min 
structures are in the expected range (8) . 

The only dihedral angle that* may require further investigation is 
F19.9 V-min Arg 108 0 which is positive and is a Gly in IMCP. 
For both V-min struct\ires, three non-polar residues are slightly more 
exposed than their Fah counterparts. These occur at junctions between 
sequence sections that have been connected in the modelling process. 
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There are also slight distortions in peptide.^ omega values in these 
regions. The deviations are relatively small and are iinliicely to 
present major structural problems. 

Similarly two polar residues have been identified in both V-min 
structures as possible problems, however inspection of their 
environments (Figures 10-11) suggests that the polar groups are 

adequately bonded. 

» 

In terms of these "modelling" criteria, both structtares appear to be 
feasible. 

^•^ Further V-mln coziBtruots 

i) The restoration of potentially important framework residues 
was alluded to earlier and conprises a four segment construct: 

F119.9 H71-111/H15-67/L1-7/L21-104 

ii) If a cysteine were to be required at the -base of the V-min in 
order to facilitate linJcage to an effector protein, the presence of 
two.disulphide bridges. (F19.9 6-27, 81-129) may coznplicate 
purification, refolding etc. The disulphides are relatively buried 
and in an extended conformation which suggest the use of valine or 
alanine replacements. 

iii) As indicated earlier, the core segments required to present 
the six CDRs can be chosen such that six core segments are used. This 
gives rise to additional combinatorial possibilities which can be 
evaluated as shown earlier for the four core segment approach, 

addition it is possible to consider the generation of 
human Fv frameworks based on genetic families or clans (13) . Mouse 
CDR's can then be grafted on taking into account the requirements for 
particiUar framework residues to control CDR loop conformation. The 
resultant Fv sequences can readily be converted to V-min constructs as 
shown earlier. 
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V) Minimal sized antigen combining moJ.ecules could be generated 

as two or more chains. The simplest such molecule would be of the type 

* 

Light Chain L83-106/L15-57 eg 67 canonical residues 

Heavy Chain H87-l09/H15-e4 eg 73 canonical residues for a total 

of 140. 



1.7 



Sequence for a consftrvatlva v-m i n basprt »n an«-^K»^y 



The vmin invention involves the generation of a single-chain Fv-like 
recognition protein by altering the order of appearance of sequence 
segments that display the CDRs. By judicious paring of non-essential 
segments, the size of the protein can be reduced to as few as ISO 
residues. For a conservative construct, the aim is to conserve more 
of the original Pv and reduce the new splice junctions, thus resulting 
in a larger protein {for example upto 210 residues). One advantage of 
the conservative approach is the retention of framework residues which 
may influence CDR loop conformation or participate in induced fit 
responses to binding, with these considerations in mind analysis of 
the x-ray structure for DX.3 Fab resulted in the following 
conservative Vmin construct :- 

H65-H115 ;H13-HS4 tLl-L108 

The light chain segmmt remains intact and there are two 
splice-junctions where remodelling is required. A loop, taken from 
pseudoazurin (2paz 34-39) served as a template for the first junction 
and two loops (leucine binding protein 2ibp 109-115. chymotrypsin 
inhibitor 2ci2 42-48) provided two possible conformations for the 
second junction. Thus two models have been generated, both are 
energetically and geometrically feasible and exactly correspond to the 
original Fv with the exception of the two splice- junction loops and 
the absent N- terminal residues from the heavy chain {H1-H12) . 
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In comparison with the Fv, more non-polar sxx^face is exposed <ca 
130A. ) , and much of this arises from the two leucines associated with 
the remodelled splice- junctions . There may be a case for replacing 
these leucines by alanine or serine to reduce the exposed non-polar 
surface (Figure 40) . Example 4 herein sets out expression of a Di.3 
V^min construct . 

In sequence number 54 there is indicated on a Dl,3 Vmin (conservative) 
an N- terminal secretion sequence (amino acids 1 to 22) as used by 
Winter et al (Nature 1985 341 544-546) in the successful cloning of Pv 
fragments used for crystallography (J. Mol. Biol. 1990 213 617-619). At 
the C-terminus there is included the c-myc tag sequence (amino acids 
234 to 244) used previously. The changes to the splice junction loops 
are also indicated, but not the possible replacement of the leucines. 



^XAXPtiB 2 GZNKRATIOK OF V-UIN EXPRESSION PIASMIDS 

Design of an McPC603 V-min Synthetic Gene 

We have designed an McPC603 V-min synthetic gene based on the protein 
sequence defined by the model. A major consideration was the use of 
E^jcoli preferred codbna but it is possible to include other features 
without compromising this objective. The gene was designed for 
intracellular expression in E.coli using expression vectors set out 
herein but has inherent f lexlbilty for cloning into other systems 
such as secretion syst«ns as set out in example 4 herein. Protease 
deficient strains of E. coli may be considered for expression. 

^•^•^ Prepgration of a v-min avntheti e gene from antibody MePCSQl 

A DNA sequence (Figure 20) encoding the amino -acid sequence of the 
polypeptide of Figure 21 Pi which is based on the modelled structure 
for V-min was designed with the following features : 
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Codons normally chosen as those mo^t coinmonly used in highlv 
expressed E.coli genes. 

A series of restriction endonuclease sequences flanking and 
throughout the gene to facilitate flexibilty in cloning and subsequent 
genetic manipulation. 

3) Translation initiation and termination codons. 

A sequence designed to minimise potential secondary 
structure within both component oligonucleotides and predicted 
messenger RNA, 

5) Single-stranded cohesive termini to allow ligation at 

suitable sites withdLn a plasmid. 

2-1-2 Sequence compilation. 

The amino-acid co-ordinates specified by the V-min model were used to 
compile the DNA sequences for the synthetic gene using existing dka 
sequence data. The DNA/amino-acid sequence data for the Fv fragment 
of MCPCS03 is published (Skerra et al, 1991, Bio/ Technology 9, 
P273-278) . The inclusion of a c-myc peptide tag (Ward et al, 1989, 
Nature 341, p544-546) at the C- terminus is usefxil for subsequent assay 
pxirposes. There are no antibodies available which would be capable of 
recognising the framework regions of a range of V-mins and any CDR 
specific antibodies would have restricted use and could interfere or 
be blocked in an assay. 



The verified sequence was then checked for the presence of codons 
rarely used in E.coli ( according to Anderason and Kurland, 1990, 
Microbiol. Rev. 54, pi98-2l0) and the relevant changes introduced. 
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2.1»3 Introduction of unique restriction aitea. 

To maximise the f lexdlbility of the gene we attempted the introduction 
of unique restriction sites flanking each CDR region. This will allow 
the subsequent replacement/shuf fling of the binding domains by a pair 
of complimentary oligonucleotides in the form of a restriction 
fragment. The premise was that the Y-min framewrk may be the basis 
on which to build subsequent molecuJ.es with altered/alternate binding 
specificities. This could be in the form of modified or completely 
different CDRs. Such changes could be tested by protein modelling 
prior to any e3q>erimentation. 

Using the MAP/SII*ENT programme in the University of Wisconsin GCG 
software package (Devereiix et al,l984, Nuc. Acids Res, 12 , p387-395, 
hereinafter refered to as "Wisconsin"), the V-min sequence was 
searched for potential restriction sites which cauXd be introduced by 
codon changes but without causing amino-acid substitutions . From the 
results of all potential restriction sites a choice of unique sites 
flanking the CDR regions, which do not compromise the E . coli codon 
preference, was made. 

Codon usage was again derived from this sequence and the relevant 
balance between the various codons as quoted in codon usage tables 
adjusted without compromising any of the introduced restriction sites. 

2.1.4 Cloning stratecnr. 

The phagemid vector pBluescript KS- (short et al, 1988, Nuc.Acids Res. 
16, p7583 and available from Stratagene) was chosen as the initial 
cloning vector to provide a permanent source of the gene. The 
conttplete nucleotide sequence of pBluescript KS- is available in the 
GenBank Genetic Sequence Data Bank, BilofsJcy et al, 1986, Nuc.Acids 
Res. pi-4, filename VecBase (3) rBlueKSm, accession number VB0077 

(Fig 22) . This vector has the advantages of high copy number for ease 
of plasmid p\arif ication, colour selection for recombinants and the 
ability to generate single -stranded DNA which is \iseful for 
introducing siibsequent manipulations via site-directed mutagenesis. 
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The manner of insertion into this vector can.be achieved in 
non- expressing orientation. 

The strategy allows for insertion of a synthetic V-min gene fragment 
between the SsnL and BamHl sites of the vector. By including 
additional unique restriction sites on the synthetic fragment which 
flank the gene, high flexibility for subcloning from this source is 
possible. After cloning into pBluescript, the majority of restriction 
sites within the gene remain unique. 



^•^•5 Subclo ning into expression vectors. 

The gene can be cloned into pICI expression vectors (described below) 
on a Seal (blunt- end) to Xbal restriction fragment between the Konl 
(T4 polymerase treated to create a blunt end) and Xbal sites. The 
gene can be subsequently transf ered to vectors with different 
promoters by simple restriction fragment swap such as for example Tm 
vectors . 

The gene design also allows subcloning into any vector which makes use 
of an Ndel site to create the ATQ initiation codon. in this case a 
single N- terminal amino-acid addition (Ser) would be created. 

Cloning into E^coli secretion vectors wotad be useful in that any 
sequence modifications coiUd be readily assessed in a binding assay. 
This would allow rapid generation and screening of several constructs . 
N- terminal amino-acid substitutions would result from cloning into the 
lac promoter /pelB leader vectors used for antibody fragment expression 
vectors (Gussow et al, X989, Cold Spring Harbor Symp . Quant . Biol LIV . 
P265-272) . These substitutions have been modelled and are not 
considered to significantly alter the folding of the molecule. 
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A racv sequence file was con,piled for the v-ndn gene in trp and 
lambdap^ promoter vectors with two different ribosome binding site 
(RBS) sequences. In each case, the progranune FOU> in Wisconsin was 
used to predict mRNA secondary structure over the first XOO bases of 
message. Whilst not entirely predictive, this programme can be 
employed to verify translation initiation problems and can alert the 
researcher to any potential problems with a given sequence. After 
running this programme minor changes were made to the 5' region pf the 
gene to remove a potential stem-loop which reduced the availabilty of 
the AUG initiation codon. These changes did not con^^romise the 
previous design features. The new stem-loop predictions have a low 
free energy and allow good access to the initiation codon. 



2.1.7 



Sgcondarv struf.t--i,^» ronsiri.»^a^ s»T,o 



The gene was divided into the 20 oligonucleotides (Seq.iD No a - 20) 
shown in Figure 20/Table 2 on an arbitrary basis. The Wisconsin 
programme STEMLOOP was then used to identify potential secondary 
structures within the gene, mese secondary structures can cause 
problems in the correct annealing of complementary oligonucleotide 
pairs during gene synthesis. Any significant stem-loop structure 
occuring within a particular oligonucleotide sequence was destabilised 
by base changes which did not compromise the main design 
considerations. 



2.2 



Preparat-ion of m iaonuel>>y |;^,rf«B 



The oligonucleotide sequences shown in Table 2 were prepared using 
known methods such as on an Applied Biosystems 380A ONA synthesiser 
from S'dimethoxytrityl base-protected 

nucleoside-2-cyanoethyl-N.N-di.isopropyl. phosphoramidites and 
protected nucleosides linked to controlled-pore glass supports on a 
0.2 scale, according to protocols supplied by Applied Biosystems 
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Xnc. Alternative apparatus includes Applied. Biosystems 3 SOB or 394 
DNA synthesisers using 5 ' dimethoxytrityl base -protected 

nucleoside-2-cyanoethyl-N,N-di-isopropylphosphoramidites and protected 
nucleosides linked to controlled-pore glass supports on a 0.2 Mmol 
scale, according to protocols supplied by Applied Biosystems Inc. 

Alternatively, the oligonucleotides can be prepared manually 
by the methods described by Atkinson and Smith in -Oligonucleotide 
Synthesis, a manual approach" (M.T.Gait, editor, IRI. Press. Oxford. 
Washington DC, pages 35-81) . 

The purification of the oligonucleotide sequences subsequent to 
automated synthesis was as follows : 

Each synthesised oligonucleotide, after cleavage from the solid 
support and removal of all the protecting groups, was dissolved in 
double distilled water (iml) . An aligout (400^1) of each 
Oligonucleotide was taken and the DNA precipitated by the addition of 
50m1 of 3M sodium acetate, pH6.5 and iral absolute ethanol. After 
incubation on ice or at -20oc for a minimum of 15 minutes, the 
precipitated DNA was pelleted by centrifugation at 13.00olpm for 10 
minutes. The supernatant from each pellet was discarded and the 
pellet washed in 1ml of 70* ethanol (in water) . re-centrifuging for 5 
minutes. Pellets were then dried under vacuum before resuspending in 
20M1 distilled water and 5^1 of a formamide/dye mixture <lOmM NaOH, 
O.SmM EDTA. 0.01% each of bromophenol blue and xylene cyanol and 80% 
formamide) . The san^jlea were then electrophoresed on a 10% 
polyacrylamide gel containing 8.3M urea in lOOmM Tris-borate buffer, 
2mM EDTA (pH8.3) . 
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After electropboresis. the full-length ollgowcleotide product was 
excised from the gel after visualisation by UV-shadowing {Narang e£ 
al, 1979. in Methods in Enzymology. 68, pp90-98) . This was generally 
the major product of the synthesis although products of intermediate- 
size were also visible. The DNA was electro-eluted from the 
acrylaraide in a dialysis bag containing 400,:1 of lOmM Tris-borate 
buffer. O.ZmM EDTA (pH8.3) at 300v for 2 hours. The eluate was 
collected and the DNA recovered by ethanol precipitation as described 
above. Alternatively, recovery of the DKA can be achieved by the use 
of proprietary kits such as Spin-X filters (Costar) with the protocols 
recommended by the manufacturers. The concentration of each purified 
oligonucleotide solution was estimated by scanning a diluted san?,le 
from 300 to 220nm in a uv spetrophotometer and using an extinction 
coiBfflcient of lOD,^„..20Mg/ml. 

2. 3, Gene Assembly 

Oligonucleotides fieq.ID 2-19 (i00-400pm of each) were phosphorylated 
with T4 polynucleotide kinase (50 units, Pharmacia Biosystems Ltd.) in 
a solution containing ATP IB(M) . ATP (SOpM, New England Nuclear, 

SOOOCi/mmol) , spermidine (lOO^iM) . MgCl^ {20mM) , Tris-HCl, pH9.0 (50mM) 
and EDTA (o.lmM) at 37-c for ihr. The effectiveness of the 
phosphorylation reaction was assessed by electrophoresing a 2,tl sample 
of each reaction on a 10% polyacrylamlde gel containing 8.3M urea in 
lOOmM Tris-borate buffer. 2mM BDTA (pH8.3) followed by autoradiography 
of the gel. The reactions containing oligonucleotides seq.lD lo and 
20 were incubated at 70»C for 10 minutes to inactivate the T4 
polynucleotide kinase thus preventing premature phosphorylation of the 
complementary partner t^on mixing. 
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The phosphorylated oligonucleotides were then mixed in complementary 
pairs as shown in table 1 to produce the duplexes A to j. The two s- 
Oligonucleotide sequences (seq.ID 1 and 11) were used unphosphorylated 
{200pm in 2Snl) to prevent concatomerisation of the final or 
intermediate gene products . The mixed oligonucleotide pairs were then 
ethanol precipitated as previously described. Each pair was 
resuspended in IOOmI of distilled water and the complementary 
sequences annealed by heating to loocc for 1 minute before gradually 
cooling to room temperature over several hours or overnight. 

The annealed pairs were then mixed into 3 sets as shown in Table 2. 
The volume of sets II and III was increased to 400^1 by the addition 
of lOOMl of distilled water. Protein contamination was then removed 
from each set by treatment with Strataclean resin (Stratagene) in 
accordance with the manufacturer's instructions. Alternatively, this 
can be achieved by extraction against an equal volume of phenol! The 
DNA was again recovered by ethanol precipitation and the pelleted DNA 
dried under vacuum. 

The DNA in each set was then ligated together by resuspension in 30^1 
of a solution containing T4 DNA ligase ( i unit; Bethesda Research 
Laboratories), SOmM Tris-HCl, pH7.6, XOmM MgCl^, 5% w/v polyethylene 
glycol 8000. in« ATP and imM DTT (BRL, Focus. 8, winter 1986) at is-c 
for a minimum of i.s hours. The ligated DMA from each set was again 
recovered by ethanol precipitation, resuspended in 20/^1 distilled 
water and 5^1 formamide dye mix and each set electrophoresed on a X0% 
polyacrylamide/8.3M urea gel. Bands of the appropriate length 
(163-217 bases) were identified by autoradiography, excised from the 
gel and recovered by electroelution and ethanol precipitation. These 
products were then annealed, the sets pooled, the DKA recovered by 
ethanol precipitation and then ligated to generate the complete gene 
sequence in a repeat of the above processes. 
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Recovery of the intact gene sequence can be ^dxieved by electrophresis 
of the ligation products on a 4% polyacrylamide/S . 3M urea gel. 
autoradiography, excision of the appropriate bands, electroelution and 
ethanol precipitation. The recovered DNA then requires annealing to 
produce the double- stranded gene fragment. Alternatively, a sample 
(IMI) can be added to a polymerase chain reaction (PCR) containing 
oligonucleotide seq.ID Nos. i and ii (o.25mM of each) , dATP. dCTP 
dGTP and dTTP (0.2mM of each). KCl (50mM) , Tris-HCl, pH8.3 (lOmM) ! 
MgCl^ (l.smM), gelatin {0.01% w/v) and Tag polymerase (2.5 units 
Cetus Amplitaq) and incubating at. for example, 94-C for 1.5 minutes 
550C for 1 minute and 72oc for i.s minutes for 25 cycles plus 72-0 for 
5 minutes in a programmable heat controller (Techne PHC-l) . The 
resulting single, amplified product can be purified from excess by 
spun cow chromatography using a Miniprep spun column (Sephacryl 
S-400, Pharmacia Biosystems Ltd.) or other proprietary equipment in 
accordance with the manufacturer's instructions and recovered by 
e thano l precipitation. 

Prior to cloning the isolated gene must be phosphorylated (as 
described for the individual oligonucleotides) to allow, ligation with ' 
a suitably digested cloning vector. 



2.4 



Clonino the Svn^■ he3i3«>t^ Bona 



The synthetic gene was designed with cohesive termini which allow 
insertion into a general cloning vector digested with the restriction 
enzymes and laiBHI such as the phagemid pBluescript (see "Cloning 

Strategy- above and fig 21) or any other suitable phagemid or plasmid 
vector. Cloning into the SesI site does not result in regeneration of 
a restriction site (GCTACA) whereas the BajnHl site is retained. 
Additional restriction sites are Included in the synthetic gene both 
5- and 3' to the actual coding sequence to allow flexibilty in 
sub-cloxxing for expression. 
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A sample of pBluescript KS- (s^g) was digested with the enzymes Ksnl 
(25 units, Bethesda Research Laboratories) and BaniHI {24 units, 
Amersham International) in a 100^1 solution containing lOmM 
Tris-acetafce buffer. pH7.5. 10 m magnesium acetate and SOmM potassium 
acetate (provided in lOx concentration as One-Phor-All buffer PLUS 
Pharmacia Biosystems Limited) . The reaction was incubated at 37oc'for 
1.75 hours, control samples of the phagemid (O-S^g) were digested 
with each enzyme individually in a 10^1 volume under identical 
conditions. After digestion, samples of each restriction digest (o.i 
- O.s^g) are electrpphoresed on a O.B% agarose gel in a solution 
containing 20mM Tris-acetate, pH8.0, O.SmM EHTA at lOOV for 30 minutes 
followed by staining with ethidium bromide (l^g/ml) and visualisation 
under uv Illumination. Digestion with each enzyme in the double 
digest was confirmed by the presence of both major (28S6bp) and minor 
(65bp) bands and by complete linearisation with each enzyme 
individually. 

The double digest was then treated with Strataclean resin (Stratagene) 
to remove proteins using the protocol provided by the supplier and the 
UNA recovered by ethanol precipitation. The purified DMA pellet was 
resuspended in 25^1 of distilled water and 5^1 of a solution 
containing 25% sucrose. 5mM sodium acetate. 0.1% sodium dodecyl 
sulphate and 0.5% bromophenol blue before electrophoresis on a 3% 
Nusieve GTG agarose gel (FMC Bioproducts) in 20mM Tris-acetate. pH8 0 
and O.SmM EDTA at 80V for l hour. The DNA was visualised under UV 
illumination after staining the gel in ethidium bromide solution 
(l^g/ml) . the major vector band excised from the gel and the DNA 
recovered by phenol, phenol/ chloroform and chloroform extractions by 
the procedures provided by the suppliers of the agarose. Alternative 
methods for isolation and purification of DMA fragments are provided 
in "Molecular Cloning - a laboratory manual". Maniatis. Ftitsch and 
Sambrook. published by CSH laboratory, second edition 1989 and . 
hereinafter referred to as "Maniatis- (Chapter 6. p22-35) . After 
recovery, the DMA was resuspended in XO^l of distilled water. 
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Alternatively, isolation of the synthetic ge^ie via PGR results in a 
product which can he cloned into a vector digested with a restriction 
enzyme which creates blunt ends such as Smal digested MXZmeXO 
(Messing, j., i983. Methods in Enzymol.ioi, p20) available from 
Amersham International. M13mpl0 can be purchased (from Amersham 
International) already digested with Smi and de-phosphorylated with 
alkaline phosphatase to prevent re-circularisation. Alternatively any 
suitable vector can be digested with a restriction enzyme generating 
blunt ends and then treated with calf -intestinal or bacterial alkaline 
phosphatase using protocols provided by the supplier of the 
appropriate enzyme. 

Cloning of the isolated synthetic gene was achieved by ligation with a 
sample (50-lOOng) of the and BamHI digested pBluescript KS- 

vector in a solution containing T4 DNA ligase ( i unit; Bethesda 
Research Laboratories), 50«M Tris-HCl, pH7.6. XOmM MgCl^, 5* w/v 
polyethylene glycol 8000, imM ATP and ImM DTT which is Lcubated at 
150C for a minimum of 2 hours. Alternatively the PCR generated 
synthetic gene is ligated with the §mal cut and dephosphorylated 
Ml3mpio under similar conditions. After ligation, the dna can be used 
to transform competent E;.?oli cells such as strain DHSa (a Dlac 
derivative of strain DHl. Hanahan. D., 1983, J.MOl.Biol. 
P557-580) for pBluescript and strain JM109 ( Vanish- Perron et al. 1985. 
G«ie 33. P103-119) for MlBn^lo. The competent cells can be purchased' 
from a number of suppliers such as Bethesda Research Laboratories or 
Stratagene. Alternatively, procedures for the generation and 
transformation of competent cells are provided in Maniatis, Chapter l 
P74-84 and Chapter 4 p33-38. The vectors used can transform E.coli 
cells to ampicillln resistance (lOOMg/ml) or in the case of M13mpl0 to 
generate plaques due to growth inhibition of infected cells. 
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Having selected ampicillin resistant colonie? or plaques, these can be 
propagated and DNA purified for analysis. With both vectors, double 
stranded phagemid or replicative form (rf) dna can be recovered from 
cultured cell pellets. Alternatively single stranded DNA can be 
prepared by rescue of phage particles using helper phage (such as R408 
described by Russell et al, xsb6. Gene ^5. p333.338) from phagemid 
containing cells or directly from the supematants of cells infected 
wxth M13mpl0. standard protocols for the purification of double- and 
Single-stranded DNA are provided in Maniatis (Chapter 1, p2i.s2 and 
Chapter 4. p26-32) . Double-stranded DNA can be used for analysis by 
restriction enzyme digestion to identify clones with inserts of the 
appropriate size. Both types of DNA can be used for DMA sequence 
analysis by the dideoxy chain termination method of Sanger 
(Proc.Nat.Acad.Sci. USA 74. 1977, p5463) using a proprietary 
sequencing kit such as the Sequenase kit supplied by united states 
Biochemical Company and used in accordance with the protocols provided 
and analysed by autoradiography after high resolution polyacrylamide 
gel electrophoresis (Sanger and Coulson, 1978, PEBS lett.fil, pl07) . 

one plasmid clone identified as containing the gene sequence was named 
PMIN603. The construction of this plasmid is outlined in fig 23. 



2.5 



Cpngtruetlon of r rroU e«>T-»B.:. ion v^nt-r.,- ,. 



The V-min gene can be cloned into a number of E.coli expression 
vectors for intracellular accumulation. Such vectors can allow 
constitutive' expression from a ^ promoter (Bennet and Yanof sky. 
1978. J.Mol.Biol.. i2i, P179-192) or regulable expression from a' 
lambda promoter under the control of a temperature sensitive CI857 
repressor gene (Remaut gt al, X9B6, Methods in Enzymology 19, 
P3ff6.383) . suitable alternative promoters can also be employed for 
this purpose. 
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variants of these vectors can be prepared in., which the only difference 
is in the UNA signal for translation initiation site or ribosome 
binding site (RBS) . These variants can be compared for optimal 
accumulation level which can vary with different RBS sequences . 

2.5.1 PICI0Q20 

Plasmid vector picio020 is a pATlS3 based plasmid vector in which the 
CSlbp EcoRl-Acel region is replaced by a I67bp EcoRl-Clai fragment 
consisting of:- 

1) a synthetic E^^ali trp promoter and trp leader ribosome binding 
site 

2) a translation initiation codon 

3) a multiple restriction enzyme recognition sequence derived from 
Ml3mpi8. containing sites for Kpnl, BamHI, xbal. Sail. Pstl, SphI and 
Hindlll. 

4) a synthetic transcription termination sequence. 

The construction of a plasmid vector containing a synthetic trp 
promoter sequence is published (Windass gt al Nuc. Acids Res. lo 
P6639-6657, 1982) . A promoter fragment was isolated from such a 
vector after digestion with the enzymes EcoRl and SesI and 
purification of the appropriate band from an agarose gel by 
electro-elution (in Maaiatis. Chapter 6 p28) . 

A pair of complementary synthetic oligonucleotides were prepared which 
would llgate to the gEal end of the promoter fragment providing the 
natural trg leader ribosome binding site, a translation initiation 
codon and a 3' iffiai cloning site. These oligonuleotides were mixed in 
equimolar concentrations and allowed to anneal by heating to lOO-c 
followed by slowly cooling to room temperature. 
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The promoter fragment and annealed oligonucl.eotides were then ligated 
and the appropriate band isolated from a polyacrylamide gel by 
electroelution. This fragment was then ligated with an M13mpl8 vector 
derivative containing the trg attenuator sequence (generated from 
synthetic oligonucleotides) cloned into the HindiiT site and 
introducing an additional Clal restriction site 3' to the attenuator. 
The ligated DMA was trahsfected into E.eoll strain JM109 
(Yanisch-Perron et al Gene, ai, pi03. 1985) made competent by the 
CaCl^ method (Maniatis, chapter lp82) . After plating out and 
incubation of the plates, plaques were screened by the method of 
Benton and Daviea (Maniatis. chapter 4p4l) using a labelled probe 
generated by .nicJc translation of the IsaRI-HEal promoter fragment 
isolated previously. Single stranded DNA was prepared from positively 
hybridising plaques by a standard method (Maniatis, chapter 4p29) and 
sequenced using the m3 universal primer and the Sanger dideoxy chain 
termination method as provided in kit form by a number of suppliers 
eg. Sequenase (United States Bioscience) . 

RP DNA was prepared from one isolate in which the promoter/ribosome 
binding site/attenuator sequence had been confirmed. This DNA was 
digested with isaRI and £iai and the appropriate fragment isolated 
from a polyacrylamide gel as above. Plasmid pAT153 (Twigg and 
Sherratt. isao. Nature Ml. p2X6-2xa) was digested with the enzymes 
EC2RI and AccI and ligated with the isolated promoter fragment. 
Ligated -DNA was transformed into competent E.coll HBIOI (Bethesda 
Research Laboratories) and amplcillin resistant colonies selected. 

Plasmid DNA from several clones was prepared and DNA sequence derived 
from the region between the Isogl and Clal sites. One clone confirmed 
as containing the correct promoter/attenuator region was named 
PICI0020. This construction is outlined in fig. 24. 
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^•^•^ Pr-eparat:ion of Pl asmid p rm m 

* 

The plasmid pCHlOl corresponds to pici 0020 except that the 
EcoRI-Sali fragment is replaced by a fragment consisting of the SEQ ID 
NO 37 <see Figure 25) and the interferon gene sequence as described 
by Edge M.D. gt gl. Nucleic Acids Research 1983, Volll, p64l9-6435 
in this regard the 3'.terminal ATG codon of SEQ ID No 37 immediately 
precedes the TGT codon which codes for cysteine (amino acid l) in the 
interferon sequence of the above-mentioned Edge M.D. et al Nucleic 
Acids Research reference. The 5' nucleotide sequence GATCCA.TG and the 
complementary 3' nucleotide sequence GTAC are thus omitted from the 
nucleotide sequence of the aforementioned reference. 

2.5.3 PICI0042 

PICI0042 is a plasmid in which the antibiotic resistance markers of 
PAT153 have been replaced by a single, inducible tetracycline 
resistance gene from the plasmid RP4 (encoded by the gene tetA and 
regulated by the product of the tetR gene) . These genes have been' 
characterised by Klock et al (J.Bacteriol. 161 p326-332, 1985) A 
Plasmid stability function (cer) has also been incorporated. This 
obviates the requirement for S-lactam antibiotics in cultures of cells 
bearing this plasmid. Because the new resistance marker is only 
expressed in the presence of antibiotic, the tetA gene product will 
not be a potential contaminant in cultures where the plasmid is 
stabily maintained in the absence of tetracycline. 

The initial stage in the generation of this vector was to produce a 
derivative of pATlS3 from which the gene encoding tetracycline 
resistance had been completely removed. A complementary pair of 
synthetic oligonucleotides were designed to replace the gSSRl-A^al 
fragment from pAT153 with a short sequence containing several unique 
restriction endonuclease sites for subsequent cloning. 
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PAT153 plasmid DMA was digested with the enzymes EcoRI and Aval and 
the 2.175Kbp plasmid DNA fragment isolated from a 0.7% agarose gel 
using Geneclean (Bio 101, California) in accordance with the 
manufacturers instructions. The l.42SKbp fragment containing the 
tetracycline resistance gene is thus removed. 

The oligonucleotides (fig 27/27A) were phosphorylated using T4 
polynucleotide kinase and equimolar amounts annealed together. A 
sample of th^ annealed oligonucleotides was then ligated with the 
plasmid fragment from" pAT153. Ligated DNA was transformed into 
competent B.coli HBlOl (BRL) and ampicillin resistant colonies 
selected. 

several colonies were picked for small-scale plasmid DNA preparation 
by the method of Bimboim and Doly (as specified in Maniatis, chapter 
lp25) and the desired construction identified by restriction analysis 
with suitable enzymes eg. EcoRI. ^zal and BajBHI. The structure of 3 
isolates identified as having the correct restriction pattern was 
confirmed by DMA sequence analysis using a pBR322 EcoRI site clockwise 
primer (New England Biolabs) . one isolate was named picioois. 

RP4 plasmid DNA was isolated from extant stocks by the method of 
Holmes and Quigley (Maniatis, chapter lp29) . This DNA was cut to 
completion with Bglll and then partially with Xinal (at 25oc for up to 
35min) taking samples at various timepoints until a 2.45 Kbp fragment 
containing the £s£E and £e£a was clearly identifiable, a sample of 
PDC8 DNA (Yanisch-Perron sfe al, 1985. C3ene pX03-ll9 and available 
from Amersham International) was digested to con^jletion with BamHI and 
JSnal. Ligations were performed to insert the tetracycline resistance 
genes into the pUC8. Ligated DNA was transformed into E.coli CSOO 
(Appleyard. R.k. Genetics 32 p440, 1954) made competent by the CaCl 
method (Maniatis, chapter lp82) and tetracycline resistant colonies^ 
selected. Plasmid DNA was prepared from 8 clones (Holmes and Quigley) 
and the presence of the RP4 tetR and A genes confirmed by restriction 
analysis. One of these isolates was named pTB344. 
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The tetracycline resistance genes were then .inserted into pICIOOls 
(described above) by replacement of an EcoRl/PstI fragment from 
PICI0019 with the corresponding fragment from pTB344. This results in 
replacement of the majority of the ampicillin resistance gene in 
PICI0019 With the tetracycline resistance genes. After digestion and 
ligation of the plasmid DMAs, followed by transformation into E.coli 
C600. colonies were selected on the basis of phenotype ie. Tc^ and 
AP . Plasmid DNA was prepared from 4 such clones and digested with a 
combination of enzymes eg. BanSHI/Pstl/Ss£I, SS&Rl/§all, Sma l. 
Slacx/fiall and Aial/Pstl. All 4 clones produced restriction patterns 
consistent with the desired construct. One of these was designated 
PTB351. 

summers and Sherratt <Cell 36 pl097.ii03. 198*) have shown that the 
instability of plasmids derived from ColEI (eg. pAT153) is due to the 
loss of a 283bp sequence, ger, present in the parent plasmid. This 
sequence helps prevent the formation of plasmid oligomers, the latter 
appearing to disrupt plasmid partitioning in some as yet undefined 
way. The see sequence (Summers, D. et al MGG 2£i, p334-338, 1985) was 
kindly provided by Prof . D . Sherratt in the form of a fragment cloned 
into PUCX8 (pKS492> . pKS492 plasmid DKA was digested with BamHI and 
Tagi to release a 289bp £sr-contaiaing fragment. Plasmid pTB35l DNA 
(isolated from the daa" host E^coli GM48 - Arraj, a. a. and Marinus, 
M.G. J.Bact. iSl P562-5S5, 1983) was digested to completion with BamHI 
and £lal and ligated with the digested pKS492 UNA. After 
transformation of ligated DNA into competent E.coli C60O, tetracycline 
resistant colonies were selected. Restriction analysis of plasmid DNA 
from putative clones with the enzymes Aval. JSial and P^il was used to 
confirm the presence of sgr. One isolate with the correct structure 
was named pICI0042. This construction is outlined in figs. 26 and 
27/27A. 



wo 94/12625 



- 50 - 



PCT/GB93/0237S 



2.S.4 



Insertion of an Expres sion Cassette into pICT0n4:f 



An expression cassette consisting of the tre promoter, a ribosome 
binding site and the interferon gene was isolated from plasmid 
PCHIOI (see 2.5.2 above) on an EcoSl to Sphi restriction fragment. 
This was ligated into the vector pICI0042 (see above) similarly cut 
with BcoRl and §EhI. This DMA was used to transform a competent 
culture of g- coli C600 and tetracycline resistant colonies were 
isolated. A few of these were tested by plasmid DNA preparation and 
restriction analysis for the acquisition of the SatI restriction site 
carried on the expression cassette. Clones positive in this respect 
were further tested by restriction mapping to checJc that the expected 
construct was correct. They were also checked for the conferred 
capacity to produce interferon protein as analysed on a 
polyacrylamide-SDS gel stained with Coomassie blue. One such 
confirmed clone was designated pZiBQOS. 



2.5.5 



Insertion of T4 transcrip tion terminator into pTB24a 



The T4 transcription terminator sequence in the form of the Sail to 
Hindi i:^ fragment (see SEQ ID No. 38 and Figure 28) was inserted into 
the multicloning site of an intermediate vector pTB244 (described in 
European Patent Publication No. 237,269) between its Sail and HindiTT 
sites. Clone analysis was used to confirm the structure of this 
construct (pTB244-T4 ter) . From this vector, an Ss£l to Sfiiil fragment 
containing most of the multicloning site and the T4 terminator was 
then isolated. This was inserted into pLBOOS similarly cut with SstI 
and thereby substituting the interferon or^ ^ene but leaving a 

cassette consisting of the tre promoter, multicloning site and T4 
terminator. This construct was confirmed by clone analysis and the 
plasmid designated pLB013 (Figure 29) . 
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2.5,6 Substitution of multim nr.-iT„T 

The multicloning site in pLB013 is not ideal for this vector in 
several respects: the Sail, BamHI and Smal sites are not unique, 
existing elsewhere on the plasmid. This fragment was therefore 
excised by cutting with SstI and Xfeal (both unique) and synthetic 
oligonucleotides with the sequence of SEQ ID Ko. 40:- 

MS.^ lalll Seal 

5' aOCTCgAS^GGTAC CAGATCT CTCGA SAGTaCTT 

GGTAT ACCATGGT CTAG RGAGCTCT PATnnariBTn 5. 
SBB.^ Xho l 

were inserted in its place. Clones were analysed for acquisition of 
the new restriction sites and then confirmed by sequencing, one such 
plasmid was designated pLB0X4. The new cloning sites inserted in this 
way are: Msl, Kanl, Salli. and Ssal with the previous Xbal and 

Sail following them. 



2.5.6.1 



Further modification 



It was discovered that the adjacent Sg^I and gdel sites in pLB014 
could not be cut by both these restriction enzymes either 
simultaneously or sequentially presumably because of their close 
proximity. An additional sequence was therefore inserted between 
them. This was done by cutting pM0l4 with sa££ and ffial and then 
inserting the synthetic oligonucleotide of SEQ ID No. 41:- 



5' AGCrCAGCTGCAGCATATGGTAC 
GTCGACQTCGTATAC 5' 
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Clones were analysed for acquisition of an extra PvuII or PstI site 
and then confirmed by sequencing. One such plasroid was designated 
PLB015 pICIOOSO). This plasmid, unlike pLB014, is efficiently cut 
by SstI and IC^I or Ndel. This is to provide a place to insert a 
variety of ribosome binding site sequences correctly positioned with 
respect to the upstream tr^ promoter and by digesting with and T4 

DNA polymerase treatment to generate a blunt end or by digesting with 
Ndel, designed to provide the ATG start codon of the gene to be 
e3q>ressed. 

2 • 5 . 7 Alternative promoters 

The tSEL promoter in pICIOOBO is excisable on ah EcoRI to Sad 
restriction enzyme fragment and can be replaced by a suitable 
alternative promoter element such as the regulable lambda P /cI857 
element from plasmid pICI1079 (Figure 31) which has been deposited 
under the Budapest Treaty at the National Collections of Industrial 
and Marine Bacteria Limited (NCIMB) , 23 St. Machar Drive, Aberdeen, 
AB2 IRY, Scotland, U.K. (NCIMB No. 40370, date of deposit 19 February 
1991) . A version with lambdaP^ promoter is called pICIl23 (Fig 30) . 

Other E.coli promoter elements available on a compatible restriction 
fragment could also be substituted. 

g^b-cloninq the Svnthetic V -min Gene into an escpreaaioT. 

vector 

The cloned synthetic gene in pMIN603 contains a number of unique 
restriction sites which can be used for sub-cloning the V-min coding 
sequence into an E.coli (or yeast) expression system. The enzyme Sea l 
digests at the 5' end of the gene to generate a blunt-ended fragment 
which starts at the N-terminal amino-acid codon (ACT^thr) . This can 
be fused to an expression plasmid which provides transcription and 
translation signals (promoter and ribosome binding site) and can be 
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blunt-ended immediately after a translation initiation codon 
(ATGsmet) , Examples of such vectors are the pICI es^ression vectors 
described above. Alternatively, the unique Ndel site can be used to 
create a cohesive 5' terminus which can be cloned into an identical 
site of an expression vector such as pICIOOSO described above, in 
this case the translation initiation codon is provided on the 
synthetic gene fragment and ei^ression results in an N- terminal 
extension (AGTsser) on the V-min protein. 

E , gold, expression vectors which allow fusion to a secretory leader 
sequence may also be used such that the expressed V-min protein is 
directed to the E.coli periplasm ( as described by Skerra e£ ai, 1991, 
Bio/Technology 2/ p273-278) . In addition, secreted protein can be 
fused to the C-terminal portion of a bacteriophage coat protein (as 
described by Hoogenbobm et al, 1991, Nuc. Acids Res. 19, p4133-4l37) 
to allow the generation of phage particles which display the v-min 
protein in active form on their surface as an aid to selection. These 
approaches may require the modification of 5' and 3' restriction 
enzyme sites by, for example, PCR as described in the cited examples. 
A review of the ea^ression of antibody fragment molecules in E.coli 
has been published by Pluckthun, 1991, Bio /Technology 9, p545-55X. 

The synthetic gene contains unique restriction sites for SnaBi, Bglii, 
BsaBI and Bamni after the translation termination codon (TAA) to allow 
flexibility for cloning the 3' terminus into a range of expression 
vectors. In addititon, pMIN603 has additional cloning sites for xba l. 
Not I, BstXZ, SacI and Bs[sjEiII 3' to the gene to further ixnprove 
flexibility. 
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Egression vector pICIi23 was digested with SesX (Bethesda Research 
Laboratories) in a solution containing 20mM Tris-HCl, pH7.4, SmM MgCl 
and 50roM KCl at 37«C for 2 hours. The Ksnl digested DNA was then ^ 
treated with T4 DNA polymerase (1 unit/^g DNA Amer sham International ) 
in the above solution containing O.lmM each of dATP, dCTP, dGTP and 
dTTP at 12 oc for 15 minutes. This treatment makes use of the 3' to 5' 
exonuclease activity of the T4 polymerase to remove the 4 base 
3 'extension generated by ReiiI digestion resxilting in a blunt end 
immediately after the translation initiation codon. The reaction is 
then treated with Strataclean resin (Stratagene) to remove protein and 
the DNA recovered by ethanol precipitation. The DNA was then digested 
with Xbal (Bethesda Research Laboratories) in a solution containing 
6mM Tris-HCl, pHS.O, 6roM MgCl^, 150 mM NaCl and ImM dithiothreitol 
(DTT) at 37c^C for 2 hours followed by treatment with Strataclean resin 
to destroy enzyme activity and recovery of the DNA by ethanol 
precipitation. The treated DNA was resuspended in 10^1 of distilled 
water. 

The plasmid pMIN603 was digested with Seal (Pharmacia Biosystems 
Limited) in a solution containing 20mM Tris-acetate, pH7.5, 20mM 
magnesium acetate and lOOmM potassium acetate at 37^0 for 2 hours. 
The reaction was then treated with Stratclean resin and the DNA 
recovered by ethanol precipitation. The purified. Seal digested DNA 
was subsequently digested with Xbal as described for pi CI 12 3 above. 
The protein is removed by Strataclean resin treatment and the DNA 
recovered by ethanol precipitation. The digested DNA was then 
eletrophoresed on a 3% NUSieve GTG agarose gel and a DNA band 
corresponding to the McPC€03 V-min fragment (523bp) isolated from the 
gel as described previously for the isolation of Rpnl/BaniHI digested 
pBluescript KS-. The purified DNA fragment was resuspended in lOfil of 
distilled water. 
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A san5>le of the K^I/T4 polymerase treated/Jg^i digested pICI123 
(approx. song) was ligated with the Scal/Jfeal digested V-min gene 
fragment (SO-ioOng) as described previously under "Cloning the 
Synthesised Gene". Ligated DNA was used to transform competent E.coll 
strain CGSC 6300 to tetracycline resistance. Plasmid DNA was prepared 
from selected tetracyline resistant colonies according to the methods 
in Maniatis previously refered to. Restriction digestion analysis is 
used to identify plasmids which contained a cloned insert consistant 
with the size of the McPC603 V-rain gene. The cloning of the gene was 
then confirmed by DNA sequence analysis. 

One cloned plasmid with the correctly inserted V-min gene sequence is 
named pICIlS72. The construction of this plasmid is outlined in 
fig. 32. 

Alternatively, similar techniques can be used to clone the v-min gene 
into other vectors for intracellular accumulation, secretion to the 
periplasm or for display on the surface of a bacteriophage. 

2.7 Expression stuciiea 

Plasmid PICI1572 is used in expression studies . It is used to 
transform a range of E.gol:^ host strains, some of which are protease 
deficient, to assess accumulation levels in different genetic 
backgrounds . 

cultures {7Sml) are grown in M9 medium (i2.8g/l di-sodium hydrogen 
orthophosphate septahydrate, 3g/l potassium di-hydrogen 
orthophosphate, 0.5g/l sodium chloride, l.Og/1 ammonium chloride) plus 
0.2% (w/v) glucose and 0.02% (w/v) casein' hydrolysate, in the presence 
of tetracycline (IS ftg/ml) . shaking at 37<>C for 7-16 hours. 
Ea^ression is induced by increasing the growth teit5>erature to 42«C for 
2 hours prior to harvesting the cells. The OD^^^ of the culture is 
measured in a spectrophotometer, the cells pelleted by centrifugation 
at lOOOx g for s minutes and the cell pellet is resuspended in a 
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solution containing 62 . 5mM Tris-HCl, pH6,8, ,1% (w/v) sodium dodecyl 
sulphate, 10% sucrose, SOmM DTT and 0.05% (w/v) bromophenol blue to 
°^540"^° incubated at lOOoc for 15 minutes. A 20^1 aliquot of 

each sample is then used for analysis by sodium dodecyl 
sulphate-polyacrylamide gel electrophoresis (SDS-PAGB) analysis as 
described in Maniatis (Chapter 18, p47-55) . 

Expression of the McPC603 V-min gene is detected by the presence of an 
additional band in cells containing clone pICIl572 compared to cells 
containing the parent vector (plCI123) on Coomassie-blue stained gels. 
Alternatively, the incorporation of ^^S methionine into newly 
synthesised proteins can be used to identify additional proteins 
expressed by pICI1572 clones compared to non- expressing control 
clones. The identity of additional proteins can be confirmed by 
electrophoretic transfer of proteins from SDS-PAGE to a nylon or 
nitrocellulose membrane using a Pharmacia Multiphor electro -blotting 
apparatus in accordance with the manufacturer's instructions followed 
by Western blot analysis xising an anti-c-myc tag antibody (9E10 as 
described below) and detection by standard procedures (Maniatis, 
Chapter 18, p69-75) or Auroprobe BIh- (Amersham International) using 
the protocols provided with these reagents. 

Recombinant V-m± n Bxtraefeion and Iflolation 

Recombinant V-min expressed in E. coli is produced in 
either soluble form (particularly by manipulation of expression 
construct and growrth conditions) or as insoluble inclusion bodies. 
Secretion methodology can also be used (see Example 4) , 
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Purification of soluble forms of the protein. is accomplished for 
example by cell lysis (sonication or high pressure horaogenisation) , 
centrifugation and then separation techniques known in the art in a 
variety of combinations. Known separation techniques include the 
following : 

ion-exchange chromatography; gel permeation chromatography; ammonium 
sxilphate precipitation; acid precipitation and; 

immobilised antigen affinity chromatography (for example, for McPCS03 
Vmin see reference : Glockshuber, R. et al, 1990, Biochemistry 29, 
1352-1367, and references contained therein) . 

Renaturation and purification of insoluble V-min is accomplished 
by cell lysis (sonication or high pressure horaogenisation), 
centrifugation, washing of the pellet fraction with buffer or mild 
chaotropes, solubillsation in chaotropio agents (eg. 6 M guanidine 
HCl, 8 M urea, detergents etc with or without reductant being present) 
followed by the controlled removal of the chaotrppic agent (eg, by 
dilution or dialysis) and formation of the disulphide bonds by 
oxidation reactions (eg. using dissolved air, glutathione redox 
buffers, etc) . The resulting solubilised, renatured V-min is purified 
as above. 

For exan5)le. E. eoli paste, containing McPC603 Vttiln inclusion bodies, 
was resuspended in lysis buffer (is » sucrose, 50 mM 

(ethylenediaminetetraacetic acid, hereafter abbreviated to EDTA, 50 mM 
Tris(hydroxymethyl) aminomethane hydrochloride, pHB.O) to an optical 
density of SO (SSO nm wavelength) . Lysozyme was added to a final 
concentration of l rag/ml and the suspension was left on ice for 30 
minutes. The sample, on ice. was sonicated in 4 x 45 second bursts and 
the resulting lysed cell suspension centrifuged at 25000 g for 30 
minutes. The pellet, containing the Vtain inclusion bodies, was 
resuspended in phosphate buffered saline and recentrifuged at 25000 g 
for 30 minutes. 
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The washed inclusion bodies were dissolved in a urea solution (8 M 
urea/50 mM potassium phosphate/50 mM beta mercaptoethanol , pH 7 4) and 
the resulting solution clarified by centrifugation. The clarified 
solution was dialysed against 50 mM Tris (hydroxymethyl) aminomethane 
hydrochloride. pH lo.s overnight at 4oC. The resulting solution was 
Clarified by centrifugation and the supemate, containing the 
solubilised vtnin, further dialysed against phosphate buffered saline 
and then purified utilising methods outline above. 

2-9 Aetlvltv Aaamva 

General details of activity assays are given in 3.2 below, m the 
assay for McPCG03 V-min with cmyc tag, binding material suitable for 
the MCPC603 antibody is used in place of the Colo205 cells used for 
19.9 v-min. An example of a binding material suitable for McPC603 
V-min assay is a phosphorylcholine-protein conjugate (17) . m this 
case steps 1, 2 and 3 of the 19.9 v-min assay are replaced by:- 
1- lOOMl of a phosphorylcholine-protein conjugate solution 

(typically lo^g/ml but depending on the particular conjugate) in 0.05M 
carbonate/bicarbonate buffer pH9.s is added to each well and incubated 
at ^'C for 16 hours in a damp atmosphere. 



An alteniative assay procedure, provided appropriate tyrosine residues 
are maintained within the vmin structure, utilises the change in 
fluorescence observed on the binding of the phosphocholine hapten to 
the Vmin as described for MCPC603 antibody fragments in reference • 
Glockshuber et al, 1990, Biochemistry 29. 1362-1367 and references 
contained therein. 

to alternative rapid qualitative assay for McPC503 binding to 
phosphocholine is provided by the selective binding of vmin protein to 
immobilised phosphocholine matrices as described in reference : 
GlocJcshuber et al, 1990, Biochemistry 29. 1362-1367. 
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Generation of Vmin aenea uai na Polymerase Cha-tTi Reaction 

(PCR> 

All or part of the Vmin gene sequence may be generated by the use of 
suitable primers and DNA sequence amplification using Polymerase Chain 
Reaction. Two primers are required for each part of the Vmin gene 
generated by PCR, Other regions of the gene can be generated by 
chemical synthesis, as detailed elsewhere in this document, by 
restriction enzyme isolation of suitable gene fragments, or by other 
standard molecular biology techniques, as deemed most appropriate for 
the particular Vmin gene sequence under consideration. The individual 
fragments of the Vtain gene may then be joined together by further 
rounds of pcr reactions, or by otber standard molecular biology 
techniques . 

For example, in a typical Vmin construct consisting of two sequences 
derived from the Vh region of the heavy immunoglobulin chain of a 
specific antibody, joined to a sequence derived from the VI of the 
same antibody, all or any of the three Vmin regions may be derived by 
PCR, The specific sequence of the Vmin having been determined by 
molecular modelling, as detailed elsewhere in this document, and the 
Vh and VI nucleic acid sequence of the original antibody being 
available, in a molecular biology vector suitable for PCR 
amplification. In the case where the whole Vtain is to be generated by 
PCR, then primers are designed for the amplification reactions. For a 
three region V^ construct six primers are required, two for each 
domain. In addition other primers may be required if it is neccessary 
to alter the native DNA sequence of the original antibody in the Vmin 
regions, in order to change specific codons as dictated by the peptide 
sequence of the modelled Vmin, or to change the codon usage to be 
compatible with the final Vmin expression system. Other functions or 
DMA sequence may be added to the Vtain sequence, for exait^le 
restriction sites or a peptide leader or tag, either during the 
construction of the gene or during or by cloning into a succession of 
molecular biology vectors for maintenance, sequencing or expression of 
the cloned nucleic acid sequence. 
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Primer design for the PCR of the Vmin regions is of particular 
importance. Primers are made to be complementary with the DNA 
sequence on either side of junction of the domain regions of the Vmin. 
Primers complementary to the ends of the Vmin may be extended if 
necessary to give featxires to aid the cloning of the Vmin sequence 
into the chosen molecular biology vector. Thus a Vh sequence having 
four DNA domains, A-BC-DE-FG-H, and a VI having three domains, 
I-JK-IiM-N, which give a Vmin consisting of the three domains 
E-FC-DK-L, in that order, where A-B for example is a 5' to 3' DNA 
sequence from the 5' end of the Vh, and A and B are sequences of about 
20 nucleotides from each end of A-B. Thus the three pairs of primers 
to generate the three domains of the Vtnin, with suitable overlaps, are 

^* E (with 5' addition for cloning if required) and the 

conqplement of FC, to generate E-FC by PCR from the Vh, 

2. FC and the con^leraent of DK, to generate FC-DK by PCR from 
the Vh, 

3. . DK and the complement of L (with 5' addition for cloning if 
required) , to generate DK-X, by PCR from VI. 

The three PCR fragments are then purified by standard Molecular 
Biology techniques, and may then be used to generate the Vtnin, or 
intermediates by further rounds of PCR. 

For exan^le :- 

1* E-FC mixed with equimolar FC-DK, and primers E and 

complement of DK, in PGR will generate intermediate E- FC-DK. This 
when purified can be mixed with equimolar DK-L and primers E and 
complement of L, in PCR to generate the complete Vmin, E-FC-DK-L. 
After purification this may be cloned into a sxiitable vector. 
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2. It is possible that the Vim may be generated by mixing E-FC, 

FC-DK and DK-L together in eguimolar amounts, with primers E and 
complement of L in a PCR. 

■ It is also possible that the Vmin may be generated by mixing 

all of the initial components together in a PCR. That is, Vh and VI 
approximately equimolar, with primers E, FC, DK and complements of 
DK and FC, 



It may be necessary to test several combinations of the possible PCR 
reactions and to vary PCR conditions and con^jonent concentrations to 
find the best reaction parameters for any specific Vmin generation and 
set of primers. 

Any of the PCR generated Vmin fragments may be substituted by 
chemically synthesised double stranded DJJA or restriction fragments or 
DNA fragments obtained by other molecular biology techniques, in order 
to generate the specific Vtain by PCR or other suitable molecular 
biology technique. 

EXAMPLE 3 



Design Synthesis and Cloning of a Syn thetic V^min Gene fo^ 
Antibody 19.9 

The amino-acid sequence for a V-min protein modelled for the antibody 
19.9 is shown in fig, 34. We have previously determiiied the complete 
DNA sequence of 19.9 heavy and light chain cDNAs (Tonge, D.W. et al, 
(1993), The Year in Immunology, vol 1, eda. Terhorst, c, Malavasi, F. 
and Albertini, A., published by Karger AG, Basel, Switzerland: 
Proceedings of Biotech RIA 92 - New Generation of Monoclonal 
Antibodies in Diagnosis euad Therapy, Genoa, Italy, J^ril 12-15, 
1992, published by Karger AG (Basel, Switzerland)) and xised this 
information to compile a DNA sequence capable of encoding the 
amino-acid sequence shown. 
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A synthecic v-min gene for antibody 19.9 was .designed according to the 
.principles described for the McPC603 V-mia gene. The cloning regions 
and tag sequence which flank the coding sequence were identical to 
those used for the McPC603 synthetic V-min gene to maintain standard 
cloning strategies. The oligonucleotide sequences (ID No. 21-36) used 
to construct the gene are shown in fig. 33. 

Gene synthesis, cloning steps and expression studies were performed 
according to the protocols stated for the McPC603 V-min gene. 

3«2 Activity assays . 

3 -2.1 V-min detection and assay 

V-min is detected by antibody reaction to a specific epitope, on the 
V-min EST se, or on a peptide tag added to the V-min during the 
cloning procedures. Detection is in the form of immunoassay, by 
Western blot, dot blot. Elisa or other antibody based detection 
system. An estimate of V-min concentration may then be made by 
comparison to standard material detected by the same system. 

For example, V-min is detected in bacterial supernatant by Western 
blot. Following separation of the protein components by polyacrylamide 
gel electrophoresis, and eleotrophoretic transfer to a suitable 
membrane (for example nitro-cellulose) , the presence of V-min is 
detected by first blocking further non-specific binding to the 
membrane with a suitable protein solution (for exataplB casein) , and 
then reaction with a V-min specific antibody (for example a monoclonal 
antibody specific for a peptide tag on the V-min) . The presence and 
location of the V-mia is identified by a label on the V-min specific 
antibody (for example an enzyme reaction with a suitable chromophore) 
or by a sandwich detection system eir?>loying a labelled antibody or 
other binding molecule reactive with the first V-min specific 
antibody. 
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A similar competition system is used to estimate V-min concentration, 
v-min is bound to miorotitre plates, excess binding sites blocked with 
a suitable protein, and the binding of a V-min specific antibody 
(either epitope or tag reactive) to the bound v-min con^eted by the 
unknown concentration v-min solution (for exan^le bacterial extract) , 
at suitable dilutions. Standard solutions of v-min are also used to' 
generate assay curves. A reduction in antibody binding is detected by 
a label on the V-min specific antibody (for example an enzyme reaction 
with a suitable chromophore) or by a sandwich detection system using a 
labelled, antibody or other binding molecule reactive with the first 
V-min specific antibody. Examples of the methods may be found in the 
laboratory manuals of Sambrook et al (14) or Harlow and Lane (15) . 



^•^'^ I^entifleaticm and „«fi^,.^ioa y.^in h^n,^^„, 



to 



The binding of V-min to the antigen of the antibody from which the 
V-min CDR sequences were derived, or binding to a suitable derivative 
or substitute for that antigen (for example a peptide sequence 
identified from a protein antigen, or a glyco-llpid identified from a 
cellular antigen, or a mimotope) is determined by immunoassay (14, 15) 
or direct physical methods (for< example using the Pharmacia Biacore 
apparatus or McPC603 vmin fluorescence) . 

For example X9.9 V-min with a c-myc C- terminal peptide tag sequence 
(EQKIJSEEDL) is estimated in its binding to antigen in the form of the 
colo-rectal carcinoma cell line, Colo20S (ATCC CCL 222) , by reaction 
with the monoclonal antibody 9E10 (16). In a typical assay :- 
1- 10 CO1O205 cells (grown in serum free medium) in loo^tl of 

PBS are added to each well of a microtitre plate and left at room 
ten5)erature for lo minutes. 

2. The plates are then centrifuged at about 400g for 5 minutes 
at looc. and left to equilibrate at room tenperature for 30 minutes. 

3. 100^1 of 0.1* glutaraldehyde is added to each well to fix 
the cells, and the plates left at room temperature for 10 minutes. 
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The plates are washed 3 times with PBS, leaving the wells 
full for five minutes for each wash. The fi^al wash is emptied and 
the plates banged dry on paper towelling. 

5. Excess binding sites in each well is blocked by the addition 

of 2 00^1 of a 1% BSA solution in PBS and incubation for one hour at 
room ternperature . 

^* The plates are washed 3 times with PBS, leaving the wells 

full for five minutes for each wash. The final wash is emptied and 
the plates banged dry on paper towelling. 

' XOO|il of 19,9 V-min solution, of unknown concentration or 

standards, at a suitable range of dilutions in PBS is added to the 
wells and incubated in a damp atmosphere at 4*»C for typically 16 
hours . 

5' The plates are washed 3 times with PBS, leaving the wells 

full for five minutes for each wash. The final wash is emptied and 
the plates banged dry on paper towelling. 

lOOAii of 9E10 hybridoma supernatant at a suitable dilution 
in PBS, typically 1/30, is added to each well and inctibated for 2 
hours at 40c. 

10. The plates are washed 3 times with PBS to which 0,05% Tween 

20 may be added, leaving the wells full for five minutes for each 
wash. The final wash is emptied and the plates banged dry on paper 
towelling. 

13.. lOOftl of an anti-mouse IgG labelled with horse-radish 

peroxidase, at a suitable dilution (for example Sigma Chemicals Co. 
product A9044 at a dilution of 1/2000) is added to each well and 
incubated at 4<»C for 2 hours. This second antibody is tested for its 
non- reactivity with V-min. 

12. The plates are washed 3 times with PBS to which 0.05% Tween 

20 may be added, leaving the wells full for five minutes for each 
wash. The final wash is enptied and the plates banged dry on paper 
towelling. 
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13. To generate a suitable chromophor e , the contents of one 
capsule of phosphate-citrate buffer with sodium perborate (Sigma 
P4922) are dissolved in lOOml of water according to the manufacturers 
instructions and 30mg of o-phenylenediamine dihydrochloride (OPD) 
added and dissolved, 100/xl of this solution are added to each well, 
and incubated in the dark for up to 30 minutes, until the colour 
develops. 

14. 50/il of 25% siaphuric acid is added to each well to stop the 
reaction, and the absorbance of each well is read at 490nin, 

15. Standard curves are then prepared, and the binding • 
concentrations of unJcnowns is estimated. 

Similar methods may be found in the laboratory manuals of Sambrook et 
al (14) or Harlow and Lane (15) . 

gXAMPLB 4 . Generati on of Dl,3 V-min (conservative) using pcr 

A vmin peptide sequence for the anti-lysozyme antibody Dl.3 (Amit, 
A.G., Mariuzza, R.A., Phillips, S.E.V, & Poljak R.J. Science 233, 
747-754 (1986)), was generated by molecular modelling, based upon 
known DNA and peptide sequence generated from the plasmid 
pSWl-VHD1.3-VKDl.3-rAGl (Ward, S.E., Gussow, D., Griffiths A. D, , 
Jones, P.T- & Winter G. Natmre 341, 544-546 (1989)) by standard 
molecular biology DNA sequencing techniques. The peptide sequence of 
the Vmin is shown in sequence 45, and the vmin consists of three 
domains from the Dl.3 antibody, amino acids 65 to 115 from the Dl,3 
heavy chain (1 to 51 in sequence 45) , amino acids 13 to 64 from the 
Dl.3 heavy chain (52 to 103 in sequence 45) and amino acids 1 to 108 
from the Dl.3 light chain (104 to 211 in sequence 45) . 
The sequence of the Dl.3 insert in pSWl-VHD1.3-VKDl,3-TAGl as 
determined from the start of the first pels through to approximately 
50 nucleotide bases after the TAGl sequence is shown in Figure 35. It 
can be seen that the sequence after the TAGl differs slightly from the 
published sequence (Ward, S.E., Gussow, D., Griffiths A.D. , Jones, 
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P.T. & Winter G. Nature 341, 544-546 (1989) ) .between the EcoRl 
restriction site and end of the TAGl. It is the sequence as shown in 
Figure 35 that was used for the choice and sequence of the PCR 
primers . The specific choice of primers for Vmin gene construction 
will depend on the particular sequences to be included in the final 
construct. 

The sequences of the chosen primers are shown in sequences 46 to 51. 
All primers are chosen to have about 20 bases con^lementary to the DNA 
strand on which they are priming at their 3' end, and if possible to 
end in a GC rich sequence. The single stranded DNA primers are used 
in pairs in three PCR reactions with pSWl-VHD1.3-VKD1.3-TAGl double 
stranded DMA : - 

1. Primer Sequence 46 with primer sequence 49. 

Sequence 46 primer bases 22 to 39 are the same as bases 259 to 276 in 
Figure 35, except that the codon for Arg (262-264) is changied from AGA 
to CGT because of E.coli preferred codon usage (Andersson, S.G.E. & 
Kurland C.G. Microbological Reviews 54, 198-210 (1990)). Bases 1 to 
21 of sequence 46 code for the seven amino acids Asp Tyr Pro Gly Ser 
Ser Gly and have a Smal restriction site (GGGCCC) at bases 7 to 12- 
The Smal site was included to allow blunt end cloning of the Vmin into 
an E.coli plasmid secretion vector. Additional codons were added and 
modelled at the start of the Vtnin gene^ because of the Arg at amino 
acid 2 in the Vmin sequence 45. It is known that charged amino acids 
at the start of a gene to be secreted into E.coli periplasm can block 
generation of the protein product (Geller B,, Zhu H-Y. , Cheng S., Kuhn 
A. & Dalbey R.E. Joximal of Biological Chemistry, 268, 9442-9447 
(1993)) , and the Gly Ser Ser Gly leader was designed to overcome this. 
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Sequence primer 49 bases 25 to 45 are comple^ientary to bases 391 to 
411 in Figure 35, except that the codon for Val (400-402) is change to 
Leu to fit with the molecular model sequence of D1.3 Vmin (sequence 
45) . Bases 1 to 24 are coirplementary to bases 103 to 126 in Figure 



35. 



2. 



Primer sequence 47 with primer sequence 50. 



sequence 47 primer bases 22 to 45 are the same as bases 103 to 12S in 
Figure 35. Primer bases 1 to 21 are the same as bases 391 to 4li in 
Figure 35, except that the codon for Val (400-402) is change to Leu to 
fit with the molecular model sequence df Dl.3 vmin (sequence 45) . 

Sequence 50 primer bases 24 to 46 are complementary to bases 236 to 
258 in Figure 35. Primer bases l to 23 are con5)lementary to bases 538 
to 5S0 in Figure 35 except that the codons for Glu and Leu (544-449) 
are changed to Gin and Met to fit with the molecular modelling of Dl.3 
Vmin (Sequence 45) . 



3. 



Primer sequence 48 with primer sequence 51. 



Sequence 48 primer bases 24 to 46 are the same as bases 538 to 560 in 
Figure 35 except that the codons for Glu and Leu (544-449) are changed 
to Gin and Met to fit with the molecular modelling of Dl.3 Vmin 
(Sequence 45) . Primer bases i to 23 are the same as bases 236 to 258 
in Figure 35. 

Sequence 51 primer bases l to 24 are con^ilementary to bases 922 to 945 
in Figure 35. 
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The PGR reactions were performed using a thermocycler of 25 cycles of 
94^C for 1.5 minutes, 55°C for 1.5 minutes and 72°C for 2 minutes 
followed by a single period of 9.9 minutes at 72°C. Reactions were 20 
microlitre total volume containing a concentration of i microMolar of 
the primers, O.lng of the pSWl-VHDl . 3-VKD1.3-TAG1 double stranded 
plasmid DNA, dNTPs each at a concetration of 200 microMolar and 
amplification buffer and Taq DNA polymerase as recommended in 
'Molecular Cloning - a laboratory manual, Sambrook, J., Fritsch, E.F. 
& Maniatis T. Piablished by Cold Spring Harbor Laboratory Press, second 
Edition 1989' or the material supplier's data sheets. PCR conditions 
may need to be optimised as recommended in the book. 

The structure of the D1.3 Vmin, the Dl-3 insert in 

pSWl-VHD1.3-VKD1.3-TAGl double stranded DMA, and the primers for these 
PCR reactions are summarised diagramatically in Figure 36. The 
products of the PCR reactions are summarised diagramatically in Figure 
37. All numbering in these diagrams is based upon the amino-acid 
sequence of the D1.3 antibody. 

The major product of each PCR reaction, with the correct molecular 
size, (approximately 200 base pairs for PCRs 1 and 2 and 400 for PCR3) 
was then band purified after ruaming on a 1% agarose gel using 
steuidard molecular biology techniques. The isolated DNA from each 
band was dissolved in water to a final concentration of 
Ing/microlitre • 

The next stage in the generation of the vmin is the joining of the PCR 
products from the first round of PCRs. This could be attempted in a 
number of ways, as described above, but in this case it was found that 
a two stage PCR contai n i n g all three products of the first ro\md of 
PCRs gave a satisfactory yield of the Vmin gene product . 
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The first part of the PCR consisted of 10 cyciee in a thermocycler of 
94°C for 1 minute and 63°c for 4 minutes, with a hot start (reaction 
mixture heated to 94°C prior to the addition of the Tag polymerase) . 
The reaction mix contained each of the Vmin gene fragments isolated 
from the first round of PCRs at a concentration of 4ng per 80 
micolitres reaction volume together with dNTPs each at a concetration 
of 200 microMolar and amplification buffer and Tag DMA polymerase as 
recommended in 'Molecular Cloning - a laboratory manual, Sambrook, j. , 
Pritsch, E.P. & Maniatis T. Published by Cold Spring Harbor Laborator^ 
Press, second Edition 1989' or the material supplier's data sheets. 
PCR conditions may need to be optimised as recommended in the book. 
This first part of the reactions should generate some of the complete 
Vmin sequence because of the complementary overlaps at the ends of the 
vmin fragments from the first PCRs. 

The second part of the PCR consisted of 25 cycles of 94°C for i.s 
minutes followed by 5S°C for 2 minutes and 72°C for 2 minutes with a 
final hold of 72°C for 9.9 minutes. Prior to ' these cycles the 
reaction volume was made up to 100 microlitres by the addition of the 
two end primers, seqenoes 46 and 5X. to a concentration of ImicroMolar 
and fresh dNTPs each to a concetration of 200 microMolar and 
amplification buffer and fresh Taq DMA polymerase as recommended in 
'Molecular Cloning - a laboratory manual, Sambrook, j. , Pritsch. E.F. 
& Maniatis T. Published by Cold Spring Harbor Laboratory Press, second 
Edition 1989' or the material supplier's data sheets. PCR conditions 
may need to be optimised as recommended in the book. 
The major product of the two part PCR, with the correct molecular 
size, (approximately 800 base pairs) was then band purified after 
running on a l* agarose gel using standard molecular biology 
techniques. The isolated Vtain DHA from the band was dissolved in 
water to a final concentration of Ing/microlitre . 
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4.1 Cloning of the PCR generated Vmln aeiiQ. 

The Vmin gene product from the PCRs was the restriction enzyme cut to 
render it compatible with the plasmid expression vector. The Vmin 
gene was cut with enzymes Smal and EcoRI, following standard molecular 
Ipiology techniques , the suppliers recommendations , and the methods 
detailed in 'Molecular Cloning - a laboratory manual, Sambrook, J., 
Fritsch, E,F. & Maniatis T. Published by Cold Spring Harbor I*aboratory 
Press, second Edition 1989' . The gene product was cleaned of small 
restriction fragments by standard techniques (for example Geneclean 
II, from Bio 101 Inc., PO Box 2284, La Jolla, California 92038, USA). 

The E.coli plasmid vector selected for periplasmic secretion 
expression of the vmin gene was pICI266. This vector (pICI266) with 
a single pelB leader sequence has been deposited under the Budapest 
Treaty at the National Collections of Industrial and Marine Bacteria 
Limited (NCIMB) , 23 St. Machar Drive, Aberdeen, AB2 IRY, Scotland, 
U.K. The date of deposit was Uth September 1993 and the accession 
number is NCIMB 40589. A plasmid map of pICI266 is shown in Figure 
38. . 

Plasmid DNA was generated by standard molecular biology techniques, 
using methods detailed in 'Molecular Cloning - a laboratory mantxal, 
Sambrook, J., Fritsch, E,F. & Maniatis T. Published by Cold Spring 
Harbor Laboratory Press, second Edition 1989' . Plasmid DNA was enzyme 
digested with restriction enzyme KpnX, using the manuf actxirer' s 
recommended conditions. The cut plasmid DNA was then treated with 
Bacteriophage T4 DNA polymerase in the presence of high concentration 
of dNTPs (2milliMolar) at 12°C (details in 'Molecular Cloning - a 
laboratory manual, Sambrook. a., Fritsch, E.P, & Maniatis T. Published 
by Cold Spring Harbor Laboratory Press, second Edition 1989') to 
remove the protruding 3' termini generated by I^nl, and malcing 
compatible with the biimt end of the Smal cut Vmin. The cut plasmid 
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product was cleaned by standard techniques tfor example Geneclean li, 
from Bio 101 Inc., PO Box 2284, La aolla, California 9203B, USA) and 
then enzyme digested with restriction enzyme EcoRI, xxsing the 
manufacturer's recommended conditions. The cut plasmid product was 
cleaned of small restriction fragments by standard techniques (for 
example Geneclean II, from Bio 101 Inc., PO Box 2284, La Jolla, 
California 92038, USA). 

The Vmin gene (treated with Smal and EcoRI) and the pICI226 (treated 
with Kpnl, T4 polymerase and EcoRI) were then ligated together using 
T4 DNA ligase under standard moleculcur biology conditions (details in 
/Molecular Cloning - a laboratory manual, Sarabrook, J., Fritsch, E.F, 
& Maniatis T. Published by Cold Spring Harbor Laboratory Press, second 
Edition 1989' ) . 

The ligation mix was then transformed into a suitable Ara' E.coli 
strain such as that known locally as MSD213 (strain MClOOO of 
Casadaban, Jounaal of MoleciUar Biology 138, 179ff (1980)), (f" ara 
delta (ara-leu) delta lacX74 galV galK strR) using calcium chloride 
treated cells as detailed in 'Molecular Cloning - a Iciboratory manual, 
Sambrook, J., Fritsch, E.F. & Maniatis T. Published by Cold Spring 
Harbor Laboratory Press, second Edition 1989'* Colonies were selected 
on L agar plates containing lOmicrpgrams/ml of tetracycline. 

f-2 Identification of transf ormants expressing active Dl. 3 Vmin. 

Individual colonies from the transformation were picked into numbered 
1.5ml Eppendorf tubes containing sterile 200microlitres of L broth 
plus 0.1% arabinose plus lOmicrograms/ml tetracycline. The ttibes were 
well sealed and shaken upside down at approximately 200rpm at 30^c for 
about 16 hours. Cells in the tubes were precipitated by cent ifugat ion 
for 5 mdlnutes in a bench top microfuge at room temperature. The 
supernatant was used immediately in ELISAs, and the cell pellet kept 
at 4°C until positive clones had been identified and could be picked 
and streaked on to fresh L agar with tetracycline plates. 
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For the ELISA. polystyrene microtitre plates., were prepared by the 
addition of lOOmicrolitres of a 3mg/na solution of Chicken Egg White 
lysozyme (SIGMA L6876) in 0 . OSM carbonate/bicarbonate buffer pHS.G 
(Sicam. C3141) to each well. The plates were incubated at 4°c for 
about two hours. The wells were washed three times, with a 5 minute 
soak between, in excess phosphate buffered saline (PBS) (SIGMA P4417) . 
The plates were then banged dry onto tissues, and ISOraicrolitres of a 
solution of a l% Bovine serum albumin (SIGMA A7888 for exanjjle) 
solution in PBS added to each well. The plates were incubated at 4°c 
for about 2 hours. The walls were washed three times, with a 5 minute 
soak between, in excess phosphate buffered saline (PBS) (SIGMA P4417) . 
The plates were then banged dry onto tissues, and lOOmicrolitres of 
the bacerial supematants added to the wells, keeping careful note of 
numbers and positions so that individual clones can be identified. 
The plates were incubated at 4?c for about 16 hours . The wells were 
wkshed three times, with a S minute soak between, in excess phosphate 
buffered saline (PBS) (SICTIA P4417) . The plates were then banged dry 
onto tissues, and lOOmicrolitres of 9E10 hybridoma supernatant (Evan, 
G.I., Lewis O.K., Ramsay G. & Bishop J.M. Mol. Cell Biol., 5, 129-X36 
(1985)) at a suitable dilution (typically about 1 in 30) added to each 
well. The plates were incubated at 4*'c for about 2 hours. The wells 
were washed three times, with a S minute soak between, in excess 
phosphate buffered saline (PBS) (Sia4A P4417) containing 0.05% Tween 
20 (SIGMA P1379) . The plates were then banged dry onto tissues, and 
lOOmicrolitres of a suitable dilution of an antl -mouse igG labelled 
With horse-radish peroxidase (for example SIOIA A0944 at a dilution of 
1 in 2000) added to each well. The plates were incubated at 4°C for 
about 2 hours. The wells were washed three times, with a 5 minute 
soak between, in excess phosphate buffered saline (PBS) {SIGMA P4417) 
containing 0.05% Tween 20 (Sicaci P1379) . The plates were then banged 
dry onto tissues, and lOOmicrolitres of a solution of 30mg 
o-phenylenediamine dihydrochloride (OPD) dissolved in 

phosphate-citrate buffer with soditim perborate (SIGMA P4922) added to 
each well. The plates are then incubated in the dark at room 
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tetnperatiire iintil the colour develops, SOmicrolitres of a solution of 
25% sulphuric acid is added to each well to stop colour development. 
The absorbance of each well is read at 490nra. Positive wells are 
identified as having colour development more than twice background. 

The results of an assay are shown in Figure 48 as raw data from the 
plate reader. On this plate rows A and B were duplicate doubling 
dilutions of a positive Dl .3-Fab-cmyc construct, which was also 
included as sample numbers 3 and 34 (positions 03 p3 on plate) . 

Buffer blanks are included at positions CI, BIO, Fl and HlO, 
Bacterial controls are included at positions Ell, E12, Hll and H12 . 
Picked colonics, numbered 1 to 64 (including positive controls 3 and 
34) are placed in the other wells starting at C2 and going from left 
to right and down the plate to H9. Thus the two positive identified 
clones from this plate, F6 (OD 0.540) and G4 (OD 0,398), correspond to 
colonies 37 and 47. The purified sequenced plasmid from 37 was later 
designated paFH3037 (see below) , 

A n\imber of positive wells were identified and the clones which led to 
them were grown and used to generate pleismid DNA by standard molecular 
biology methods (details in 'Molecular Cloning - a laboratory manual, 
Sambrook, J,, Fritsch, E.F* & Maniatis T. Published by Cold Spring 
Harbor Laboratory Press, second Edition 1989') . The purified DNA was 
sequenced around the DX.3 Vtain insert, and a nxamber of them were found 
to have the Vmin sequence shown in Sequence 53. One of these plasmids 
was designated pJFH3037, and a plasmid map is shown in figure 39. 

4-3 Generation of E. coli cell pastes containing D1.3 V-mln 

E-coli strain MSD 213 was transformed with plasmid pJFH3037. The 
resultant strain MSD 215 (p»XFH3037) was pxirified and maintained in 
glycerol stocks at -eOoC. Aliquots of each culture were removed from 
stock and streaked onto agar plates of L- tetracycline to give separate 
single colonies after overnight growth at 37oC. A single colony of 
each culture was removed and suspended separately in lOral 
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L- tetracycline broth and lOOul immediately inoculated into each of 
fifteen 250 ml Erlenmeyer flasks containing 75 ml of L- tetracycline 
broth supplemented with 0.1% w.v L-arabinoae. After growth for I6h at 
30oC on a reciprocating shaker the bacteria were harvested in IL 
centrifuge bottles in a Sorvall RC-3B centrifuge (7000 x g, 4oC, 30 
min) . 

^ - 4 Purification a nd sequence identification of V-min Dl,.3 

(conversative) with c-mvc tacr 

Pellets (frozen at -80°; from 4,3) of E.coli cell paste were thawed, 
resuspended -in lysis buffer (SOnW Tris; SOtriM EDTA; 15% Sucrose; ImM 
Benzamidine, pHB.O) at 4® and 800^1 of 2.5% sodiuan dodecyl sulphate 
(SDS) solution was added for every 25 ml of the suspension. The 
mixture was left on ice for 30 min with occasional agitation, 
deoxyribonuclease added, mixed and the suspension was left on ice for 
a further 30 min with occasional agitation. The cell suspension was 
divided into 45 ml aliquots which were sonicated on ice for 12 x 3 0s 
bursts with 30s intervals between bursts. Sonicated suspensions were 
centrifuged at 13,000 rpm for 40 min at 4° and the supernatant 
collected. Phenyl methylsulphonyl Fluoride (PMSF) was added to the 
supernatant pool to give a ImM concentration followed by dialysis 
overnight at 4° into phosphate buffered saline (PBS), pH 7.3. The 
dialysed sxipematant was filtered through a 0,22Mm membrane, loaded on 
to an immobilised lysozyme column (CNBr- activated Sepharbse 4B TM 
matrix based) equilibrated at 4° in PBS, pH 7.3. The column was 
washed with PBS and bound material was eluted by SOmM diethylamine , pH 
9,5. The successful binding of the D,3 V-min to its immobilised 
antigen confirms maintenance of appropriate GDR conformations found in 
the parent Dl,3 cuitibody. Eluted peak fractions were pooled, 
tri(hydroxymethyl)aminome thane (TRIS) added to give a lOOmM 
concentration and pH adjusted to around pH 7.5 and sterile filtered. 
Further purification, utilising known methods such as ion exchange or 
gel permeation chromatography may be performed if required. 
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A sample of the material was analysed by SDSrPAGE using a 12% 
TRIS-Glycine gel which was electro-blotted on to a nitrocellulose 
membrane followed by Western blot analysis using ah anti- c-myc tag 
antibody (9E10) . The relevant band on a Coomassie stained 
nitrocelliilose blot was sequenced by an automated protein/peptide 
sequencer using the Edman degradation technique, A positive match 
with the expected N-terminal sequence of D-Y-P-G-S-S-G-S-R-L was 
obtained (D-y-p-G-?-?-G-?-R-L-) thus confirming identity between the 
isolated protein and gene sequences. 

^^^^^^^ 5 Intra-cellular E. coli expression of Dl,3 Vbin (conservative) 

In a similar fashion to the generation of the secreted D1.3 Vtnin gene 
detailed in Example 4, a Vtain gene was made by PGR for cloning into an 
inta-cellular E.coli e3q>ression vector. In this case the PCR primer 
shown in sequence 52 was tised in place of the primer sequence 46, in 
the first round PCR reaction with primer sequence 49, and in the 
joining of the three first round PCR products together with primer 
sequence 51 in the second ro\md PCR. The Vrain gene product then has 
the 5' sequence of primer sec[uence 52 at the start of the Vmin gene, 
and this provides an Hpal restriction site (GTTAAC) and sequence 
comprising RWWWWWWWW (where R is A or G, and W is A or T) preceding an 
ATG (methionine) start codon and the Vmin gene. The RWWWWWWWW 
generates a number of sequences that can act as ribosome binding 
sites, and effective constructs, which express the gene from a 
promoter in a suitable vector, can be selected by looking at 
expression levels. 

This D1.3 Vmin gene for intra -cellular expression was restriction 
enzyme cut with Hpal and BamHI. An expression vector such as pICIOOSO 
(described in section 2.5.6.X) is used for cloning of the Vmin. in 
this case the pXCiOOBO plasmid DNA is restriction cut with enzyme 
K^jnl, and then treated with T4 DNA polymerase to remove the 3' 
protruding termini, in a similar manner to that described for plasmid 
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PICI266 (section 4.1). After cleaning the Kpnl cut/T4 DNA polymerase 
treated pICIOOBO is restriction digest cut with enzyme Bglll and the 
plasraid DNA cleaned as described. Restriction enzymes Bglli and BamHI 
generate identical cohesive ends, and the D1.3Vmin gene (BamHI and 
Hpal cut) is then ligated with pICIOOSO (I^I/T4 DNA polymerase and 
Bglll treated) , as described. This inserts the Vtnin gene with a 
variety of potential ribosome binding sites under the control of the 
pTrp promoter of the pIClOOSO vector. 

The ligation mix is used to transform a range of E.coli host strains, 
some of which are protease deficient, to assess accumulation in 
different genetic backgroxinds . The methods for eaipression studies are 
essentially the same as those presented in section 2.7, except that 
the 42 C induction is not required in the pTrp promoter system of 
pICIOOSO. Protein accumulation may be affected by the temperature of 
the growth of the culture, and so a range of temperatures are tested 
for each isolate. Isolates found to give positive bands in the 
Coomassie-blue stained protein gels and/or by Western, detection of the 
c-myc TAG sequence using antibody 9E10, are then used to generate 
plasmid DNA. This is sequenced around the Vtain gene to check and 
confirm the correct nucleic acid sequence of the gene insert, and to 
identify the specif c ribosome binding site sequence, in one example 
AAAATATTT (from the RWWWWWWW possibilities) . Vmin produced by the 
intra-cellular expression system may then be extracted, isolated and 
renatiired as described in section 2.8, and the activity tested in 
systems as described in section 4.2 for an ELISA binding to lysozyme. 



^^CAMPIJC g Vtnin Constructs for Di.3 (ifdl) 
Anti-Iiysozyme antibody 



Fischmann T.O., Bentley G.A. , Bhat T.N. , Bo\aot G. , Mariuzza R.A. , 

Phillips S.E.V., Tello D., Poljak R.J. 

Journal of Biological Chemistry 1991 266 12915. 
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^•1 a) h65-ll5/hX3-64/ll-108 Intact. Light Chain 

Utilising methods set out hereinbefore the V-min construct in Figure 
41 was produced. 



^•2 b) I61-107/ll3.-e0/hl-116 Intact Heavy Chain Dl,3 

Utilising methods set out hereinbefore the V-min construct in Figxire 

42 was produced. 

6.3 c) h87-115/hl3-64/ll-7/l21-61/176-10B Minimal Construct 

D1.3 

Utilising methods set out hereinbefore the V-min construct in Figure 

43 was produced. 

gXftMPIiB 7 Vtnin Constructs for KQL (2fb4) 
Antibody from Human Myeloma patient KOL 

Kratzin H.D., Palm W, , Stangel M. , Schmidt W.E., Friedrich J., 
Hilschman N. 

Biological Chemistry Hoope-Seyler 1989 370 263. 

Marquart M. , Deisenhofer J. , Huber R. , Palm W. 
Journal Of Molecular Biology 1980 141.369. 

7.1 a) h67-i25/hl3-66/li-ii2 Intact Light Chain KOL 

Utilising methods set out hereinbefore the V-min construct in Figure 

44 was produced. 
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7.2 



b) l62"X09/112-61/hl-12S 



Intact Heavy Chain 



KOL 



Utilising methods set out hereinbefore the V-min construct in Figure 



45 was produced. 



B^CAMPIiB a Vmin Conatnicts for pac-1 
Antibody directed against Integrin Ilbllla 

Tomiyama Y, , Brojer E., Rugger Z.M,, Shattil Smiltneck J., 

Gorski J, , 

Kumar A., Kieber- Emmons T,, Kimicki T.J. 

Journal of Biological Chemistry 1992 267 18085-18092. 



8.1 



a) h66-118/hl3-.65/ll-112 



Intact Light Chain 



PAC-1 



Utilising methods set out hereinbefore the V-rain construct in Figure 
46 was produced. 



8.2 



b) 166-lll/112-65/hl-123 



Intact Heavy Chain 



PAC-l 



Utilising methods set out hereinbefore the V-min construct in Figiire 
47 was produced. 
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SBQUENCg LI5TIKQ 
(2) INFORMATION FOR SEQ ID NO: 1 
(i) SEQUENCE CHARACTERISTICS: 

(A) SEQX7ENCE LENGTH: 25 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
ATC GAT GAA TTC CTG CAG CAT ATQ A 25 
(2) INFORMATION FOR SEQ ID NO: 2 
(i) SEQUENCE CHARACTERISTICS: 

(A) SEQUENCE LENGTH: 62 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



GTA CTG CTA TCT ACT ACT GTG CAC GTA ACT ACT ATG GCT CCA CTT 
GGT ACT TCG ACG TTT GG 



45 
62 
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(2) INFORMATION FOR SEQ ID NO: 3 
(i) SEQUENCE CHARACTERISTICS: 

(A) SEQUENCE LENGTH: 59 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNHSS : Single 

(D) TOPOLOOY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 

GGT GCT GGT ACC ACT GTA GGC GGC TCC CTC CGT CTG TCT TGC GCA 
ACT AGT GGC TTC 
AC 59 

(2) INFORMATION FOR SEQ ID NO; 4 
(i) SEQUENCE CHARACTERISTICS: 

(A) SEQUENCE LENGTH: 60 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 



CTt CTC TGA CTT CTA CAT GGA GTG GGT ACG CCA GCC GCC GGG TAA 
ACG TCT CGA GTG GAT 



45 
60 
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(2) INFORMATION FOR SEQ ID NO: 5 
(i) SEQUENCE CHARACTERISTICS: 

(A) SEQUENCE LENGTH: 61 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

CGC AGC TAG CCQ TAA CAA AGO TAA CAA GTA TAC CAC CGA ATA CAG 45 
CGC TTC TGT TGA TAT C 61 



(2) INFORMATION FOR SEQ ID NO: 6 

(i) SEQUENCE CHARACTERISTICS: 

(A) SEQUENCE LENGTH: 61 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:6: 



GTC TGA CCC AGT CTA TGT CTT GCA AAT CTT CTC AGT CTC TGC TGA 
ACT CTG GTA ACC AGA A 61 



45 
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(2) INFORMATION FOR SEQ ID NO: 7 

(i) SEQUENCE CHARACTERISTICS: 



(A) SEQUENCE LENGTH: 61 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

AAA CTT CCT CGC GTG GTA TCA GCA AAA GCC TGG CCA GCC GCC GAA 45 
ACT CCT GAT CTA CGG T 61 

(2) INFORMATION FOR SEQ ID NO: 8 
(i) SEQUENCE CHARACTERISTICS: 

(A) SEQUENCE LENGTH: 51 bases 

(B) SEQUENCE TYPE: Nucleotide 
<C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



GCG TCT ACQ CGT GAA TCT GGT GTT CCG GAC CGT TTC ACC GAA GAC 
CTG GCT 



45 
51 
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(2) INFORMATION FOR SEQ ID NO: 9 
(i) SEQUENCE CHARACTERISTICS: 

(A) SEQUENCE LENGTH: 60 bases 

(B) SEQUENCE TTPE: Nucleotide 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY 2 Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

GTT TAC TAG TGT CAA AAT GAT CAC TCT TAC CCG CTG ACC TTT GGC 45 
GCC GGC ACC AAA CTG 60 

(2) INFORMATION FOR SEQ ID NO: 10 
(i) SEQUENCE CHARACTERISTICS: 

(A) SEQUENCE LENGTH: 52 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



GAA CAG AAA CTG ATC TCT GAA GAA GAC CTG AAC TAA TAA TAC GTA 
AGA TCT G 52 



45 
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(2) INFORMATION FOR SEQ ID NO: 11 
(i) SEQUENCE CHARACTERISTICS: 

(A) SEQUENCE LENGTH: 51 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS : Single 
<D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GAT CCA GAT CTT ACG TAT TAT TAG TTC AGG TCT TCT TCA GAG ATC 45 
AGT TTC 51 

(2) INFORMATION FOR SEQ ID NO: 12 
(i) SEQUENCE CHARACTERISTICS: 



(A) SEQUENCE XiENGTH: 59 bases 



(B) SEQUENCE TYPE: Nucleotide 



<C) STRANDEDNESS: Single 



(D) TOPOLOGY: Linear 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



TGT TCC AGT TTQ GTG CCG GCG CCA AAG GTC AGC GGG TAA GAG TGA 



45 



TCA TTT TGA CAG TA 



59 
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(2) INFORMATION FOR SEQ ID NO: 13 
(i) SEQUENCE CHARACTERISTICS; 

(A) SEQUENCE LENGTH: 62 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANIDHDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13; 

GTA AAC AGC CAG QTC TTC GGT GAA ACG GTC CGG AAC ACC AGA TTC 
ACG CGT AGA^ CGC ACC GT 62 

(2) INFORMATION FOR SEQ ID NO: 14 

(i) SEQUENCE CHARACTERISTICS: 

(A) SEQUENCE LENGTH: 60 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 



AGA TCA GGA GTT TCG 6CG GCT QQC CAG GCT TTT GCT GAT ACC ACG 
CGA GGA AGT TTT TCT 60 



45 
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(2) INFORMATION FOR SEQ ID NO: 15 
(i) SEQUENCE CHARACTERliSTICS : 

(A) SEQUENCE LENGTH: 53 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

GGT TAC CAG AGT TCA OCA GAG ACT GAG AAG ATT TGC AAG ACA TAG 
ACT GGG TC 53 

(2) INFORMATION FOR SEQ ID NO: 16 
(i) SEQUENCE CHARACTERISTICS: 



<A) SEQUENCE LENGTH: 60 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

ATG ACG ATA TCA ACA GAA GCG CTG TAT TCG GTG GTA TAC TTG TTA 
CCT TTG TTA CGG CTA 60 
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(2) INFORMATION FOR SEQ ID NO: 17 
(i) SEQXJENCE CHARACTERISTICS: 

(A) SEQUENCE LENGTH: 61 bases 

(B) SEQUENCE TVPE: Nucleotide 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

GCT GCG ATC CAC TCG AGA CGT TTA CCC GGC GGC TGG CGT ACC CAC 
TCC ATGTAG AAG TCA G 61 

(2) INFORMATION FOR SEQ ID NO: 18 



(i) SEQUENCE CHAIIACTERISTICS : 

(A) SEQUENCE LENGTH: 58 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

AGA AGO TGA AGC CAC TAG TTG CGC AAG ACA GAC GGA GGG AGC CGC 
CTA CAG TGG TAG C 
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(2) INFORMATION FOR SEQ ID NO: 19 
(i) SEQUENCE CHARACTERISTICS: 

(A) SEQUENCE LENGTH: €3 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS : Single 

(D) TOPOX/5GY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

AGC ACC CCA AAC GTC GAA GTA CCA AGT GGA GCC ATA GTA GTT ACG 
TGC ACA GTA GTA GAT AGC 63 

(2) INFORMATION FOR SEQ ID NO: 20 
( i ) SEQUENCE CH3\RACTERISTICS : 

(A) SEQUENCE LENGTH: 34 bases 

(B) SEQUKHCE TYPE: Nucleotide 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 



AGT ACT CAT ATQ CTG CAG GAA TTC ATC GAT GTA C 



34 
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(2) INFORMATION FOR SEQ ID NO: 21 
(i) SEQUENCE CHARACTERISTICS: 

(A) SEQUENCE LENGTH: 60 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNBSS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

GTA CTG GTA CTT ACT ACT GCA CTA CTC GCT TTG CAT ATT GGG GCC 
AAG GTA CTC TGG TTG 60 

(2) INFORMATION FOR SEQ ID NO : 22 

(i) SEQUENCE CHARACTERISTICS: 

(A) SEQUENCE LENGTH: S9 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNBSS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 



GTG GTT CCA TGA AAC TCT CTT GCG CTG CCT CAG GCT TCA CTT TTA 
GCG ACG CCT GGA TG 



45 
59 
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(2} INFORMATION FOR SEQ ID NO: 23 
(i) SEQUENCE CHARACTERISTICS: 

(A) SEQUENCE LENGTH: 53 bases 

(B) SEQUENCE TVPE: Nucleotide 

(C) STRANDEDNESS : Single 

(D) TOPOIiOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

GAC TGG GTC CGT CAG TCT CCG GAG AAA GGT CTG GAA TGG QTQ GCC 
GAA ATT GGC AAC A 

(2) INFORMATION FOR SEQ ID NO: 24 
(i) SEQUENCE CHARACTERISTICS: 

(A) SEQUENCE LENGTH: 46 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 



AAG GTA ACA ACC ACG CAA CCT ACT ATG CTG AST CTG TTG ATA TCA 
A 46 



45 
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(2) INFORMATION FOR SEQ ID NO: 25 
(i) SEQUENCE CHARACTERISTICS: 

(A) SEQUENCE LENGTH : 47 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNBSS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

AAT GAC CCA GTC TAC TAT CAC GTG TAA GGC TAG CCA GGA CAT CAA 45 

CA 47 . 

(2) INFORMATION FOR SEQ ID NO: 26 

(i) SEQUENCE CHARACTERISTICS: 

(A) SEQUENCE LENGTH: 55 bases 

(B> SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 



OCT ACC TGA GCT GGT TCC AGC AGA AAC CCG GGA AAT CTC CGA AAA 
CAT TAA TCT A 



45 
55 
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(2) INFORMATION FOR SEQ ID NO: 27 
(i) SEQUENCE CHAI^CTERISTICS : 

(A) SEQUENCE LENGTH: 59 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 

CCG TGC AAA CCG TCT GGT TGA TGG TGT ACC CTC GAG GTT CTC TGA 45 
AGA TAT GGG TAT TT 59 

(2) INFORMATION FOR SEQ ID NO: 28 
(i) SEQUENCE CHARACTERISTICS: 

(A) SEQUENCE LENGTH: 56 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 



ACT ACT GTC TAC AGT ATG ACG AGT TCC CGC GTA CCT TCG GTG GCG 
GCA CCA AGC TT 



45 

56 
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(2) INFORMATION FOR SEQ ID NO: 29 
(i) SEQUENCE CHARACTERISTICS : 

(A) SEQUENCE LENGTH: 55 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

TGT TCA AGC TTG GTG CCG CCA CCG AAG GTA CGC GGG AAC TCG TCA 
TAC TGT AGA C 



(2) INFORMATION FOR SEQ ID NO: 30 
(i) SEQUENCE CHARACTERISTICS: 

(A) SEQUENCE LENGTH: 60 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 



AGT AGT AAA TAC CCA TAT CTT CAG AGA ACC TCG AGG GTA CAC CAT 
CAA CCA GAC GGT TTG €0 



45 
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(2) IKPORMATION FOR SEQ ID NO: 31 
(i) SEQtJENCE CHARACTERISTICS: 

(A) SEQUENCE LENGTH: 56 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS : Single 

(D) TOPOI/OGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

CAC GGT AGA TTA ATO TTT TCG GAG ATT TCC CGG GTT TCT 6CT GGA 45 
ACC AGC TCA GG 5^ 

(2) INFORMATION FOR SEQ ID NO: 32 
(i) SEQUENCE CHARACTERISTICS: 

(A) SEQUENCE X.ENGTH: 45 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNBSS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
TGA CTG TTG ATG TCC TGG CTA GCC TTA CAC GTG ATA GTA GAC TGG 45 
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(2) INFORMATION FOR SBQ ID NO: 33 
(i) SEQUENCE CHARACTERISTICS: 

(A) SEQtJENCE LENGTH: 48 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

GTC ATT TTG ATA TCA ACA GAC TCA GCA TAG TAG GTT GCG TGG TTG 
TTA 48 

(2) INFORMATION FOR SEQ ID NO: 34 
(i) SEQUENCE CHARACTERISTICS: 

(A) SEQUENCE LENGTH: S6 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS: Single 
CD) TOPOLOGY: Linear 

(xi) SEQtJENCE DESCRIPTION: SEQ ID NO: 34: 



CCT TTG TTG CCA ATT TCG GCC ACC CAT TCC AGA CCT TTC TCC GGA 
GAC TGA CGG AC 



45 
56 



wo 94/12625 



- 97 - 



PCT/GB93/02375 



(2) INFORMATION FOR SEQ ID NO: 35 
(i) SEQUENCE CHARACTERISTICS; 

(A) SEQXXENCE LENQTH: 60 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS : Single 
<D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

CCA GTC CAT CCA GGC GTC GCT AAA AGT GAA GCC TGA GGC AGC GCA 45 
AGA GAG TTT CAT GGA 60 

(2) INFORMATION FOR SEQ ID NO: 36 
(i) SEQUENCE CHARACTERISTICS: 

(A) SEQUENCE LENGTH: 60 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

ACC ACC AAC CAG AGT ACC TTG GCC CCA ATA TGC AAA GCG AGT AGT 45 
GCA GTA GTA AGT ACC 60 
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(2) INFORMATION FOR SEQ ID NO: 37 
(i) SEQUENCE CHARACTERISTICS: 

(A) SEQUENCE LENGTH: 152 bases 

(B) SEQUENCE TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 
AATTCTGGCA AATATTCTGA AATGAGCTGT TGACAATTAA TCATCGAACT AGTTAACTAG 60 
TACGCAAGTT CACQTAAAAA GGGTATCGAC AATGGTACCC GGGGATCCTC TAGAGTCGAC 120 
CTGCAG6CAT GCAAGCTTAG CCCGCCTAAT GAGCGGGCTT TTTTTTATCG AC 152 

(2) INFORMATION FOR SEQ ID NO: 38 
(i) SEQUENCE CHARACTERISTICS: 

(A) SEQUENCE LENGTH: 66 bases 

(B) SEQUENCE TYPE: Nucleotide 

(C) STRANDEDNESS : Double (top Strand listed) 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
TCGACATTAT ATTACTAATT AATTGGGGAC CCTAGAGGTC CCCTTTTTTA TTTTAAAAAG 60 
CATGCA 66 
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(2) INFORMATION FOR SEQ ID NO: 39 
(i) SEQtreWCE CHARACTERISTICS: 

(A) SEQUENCE LENGTH: 210 amino acids 

(B) SEQUENCE TYPE: amino acid 
(D) TOPIiOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

Asp lie Val Met Thr Gin Ser Pro Ser Ser Leu Ser Val Ser Ala Gly 
^ 5 10 15 

Glu Arg Val Thr Met Ser Cys Lys Ser Ser Gin Ser Leu Leu Asn Ser 

20 25 30 

Gly Asn Gin Lys Asn Phe Leu Ala Trp Tyr Gin Gin Lys Pro Gly Gin 

35 40 45 

Pro Pro Lys Leu Leu lie Tyr Gly Ala Ser Thr Arg Glu Ser Gly Val 

50 55 60 

Pro Asp Arg Phe Thr Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr 
^5 70 75 80 

lie Ser Ser Val Gin Ala Glu Asp Leu Ala Val Tyr Tyr Cys Gin Asn 

85 90 95 

Asp His Ser Tyr Pro Leu Thr Phe Gly Ala Gly Thr Lys Leu Glu lie 

100 105 110 

Lys Arg Ala Asp Ala Ala Pro Thr Val Ser He Phe Pro Pro Ser Ser 

105 110 115 

Glu Gin Leu Thr Ser Gly Gly Ala Ser Val Val Cys Phe Leu Asn Asn 

120 125 130 

Phe Tyr Pro Lys Asp He Asn Val Lys Trp Lys He Asp Gly Ser Glu 
135 140 145 150 

Arg Gin Asn Gly Val Leu Asn Ser Trp Thr Asp Gin Asp Ser Lys Asp 

155 160 165 
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Ser Thr Tyr Ser Met Ser Ser Tlir Leu Tiir l^eu Thr I*ys Asp Glu Tyr 

170 175 180 

Glu Arg His Asn Ser Tyr Thr Cys Glu Ala Thr His Lys Thr Ser Thr 

185 190 195 

Ser Pro lie Val Lys Ser Phe Asn Arg Asn Glu Cys 
200 205 210 



(2) INFORMATION FOR SBQ ID NO: 40: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 220 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40 

Asp lie Lys Met Thr Gin Ser Pro Ser Ser Met Tyr Ala Ser Leu Gly 

1 5 10 IS 

Glu Arg Val Thr lie Thr Cys Lys Ala Ser Gin Asp lie Asn Ser Tyr 

20 25 30 

Leu Ser Trp Phe Gin Gin Pro Gly Lys Ser Pro Lys Thr Leu lie Tyr 

35 40 45 

Arg Ala Asn Arg Leu Val Asp Gly Val Pro Ser Arg Phe Ser Gly Ser 

SO 55 60 

Gly Ser Gly Gin Asp Tyr Ser Leu Thr lie Ser Ser Leu Glu Tyr Glu 
65 70 75 80 

Asp Met Gly lie Tyr Tyr Cys Leu Gin Tyr Asp Glu Phe Pro Arg Thr 

as 90 95 

Phe Gly Gly Gly Thr Lys Leu Glu Asn Lys Arg Ala Asp Ala Ala Pro 

100 105 110 

Thr Val Ser lie Phe Pro Pro Ser Ser Glu Gin Leu Thr Ser Gly Gly 

115 120 125 

Ala Ser Val Val Cys Phe Leu Asn Asn Phe Tyr Pro Lys Asp lie Asn 
130 135 140 
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Ala Lys Trp Lys lie Asp Gly Ser Glu Arg Gin Asn Gly Val Leu Asn 
"5 3.50 155 160 

Ser Trp Thr Asp Gin Asp Ser Lys Asp Ser Thr Tyr Ser Met Ser Ser 

1^5 170 175 

Thr Leu Ser Leu Thr Leu Thr Lys Asp Glu Tyr Glu Arg His Asn Ser 

180 185 X90 

Tyr Thr Cys Glu Ala Thr His Lys Thr Ser Thr Ser Pro He Val Lys 

200 205 

Ser Phe Asn Arg Asn Glu Cys 
210 220 



(2) INFORMATION FOR SHQ ID NO: 41: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 222 amdLno acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41 

Glu Val Lys Leu Val Glu Ser Gly Gly Gly Leu Val Gin Pro Gly Gly 

^ 5 10 15 

Ser Leu Arg Leu Ser Cys Ala Thr Ser Gly Phe Thr Phe Ser Asp Phe 

20 25 30 

Tyr Met Glu Trp Val Arg Gin Pro Pro Gly Lys Arg Leu Glu Trp He 

35 40 45 

Ala Ala Ser Arg Asn Lys Gly Asn Lys Tyr Thr Thr Glu Tyr Ser Ala 

50 55 SO 

Ser Val Lys Gly Arg Phe He Val Ser Arg Asp Thr Ser Gin Ser He 
^5 70 75 80 

Leu Tyr Leu Gin Met Asn Ala Leu Arg Ala Glu Asp Thr Ala He Tyr 

85 90 95 

Tyr Cys Ala Arg Asn Tyr Tyr Gly Ser Thr Trp .Tyr Phe Asp Val Trp 
100 105 105 
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Gly Ala Gly Thr Thx Val Thr Val Ser Ser Glu Ser Ala Arg Asn Pro 

110 120 125 

Thr lie Tyr Pro Leu Thr Leu Pro Pro Ala Leu Ser Ser Asp Pro Val 
130 135 140 

He He Gly cys Leu He His Asp Tyr Phe Pro Ser Gly Thr Met Asn 
' ISO 155 160 

Val Thr Trp Gly Lys Ser Gly Lys Asp He Thr Thr Val Asn Phe Pro 

165 170 175 

Pro Ala Leu Ala Ser Gly Gly Arg Tyr Thr Met Ser Asn Gin Leu Thr 

180 185 190 

Leu Pro Ala Val Glu Cys Pro Glu Gly Glu Ser Val Lys Cys Ser Val 

195 200 205 

Gin His Asp Ser Asn Pro Val Gin Glu Leu Asp Vai Asn Cys 
210 215 220 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 217 amdLno acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION SEQ ID NO: 42 

Gin Val Lys Leu Glu Glu Ser Gly Gly Gly Leu Val Gin Pro Gly Gly 
^ 5 10 15 ■ 

Ser Met Lys Leu Ser Cys Ala Ala Ser Gly Phe Thr Phe Ser Asp Ala 

20 25 30 

Trp Met Asp Trp Val Arg Gin Ser Pro Glu Lys Gly Leu Glu Trp Val 

35 40 45 

Ala Glu He Gly Asn Lys Gly Asn Asn His Ala Thr Tyr Tyr Ala Glu 

50 ss 
Ser Val Lys Gly Arg Phe Thr Val Ser Arg Asp Thr Ser Lys Ser Arg 
^5 70 75 . 90 
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Val Tyx Leu Gin Met Asn Ser Leu Arg Val Glu Asp Thr Gly Thr Tyr 

85 90 95 

Tyr Cys Thr Thr Arg Phe Ala Tyr Trp Gly Gin Gly Thr Leu Val Thr 

10.0 105 110 

TVal Ser Ala Ala Lys Thr Thr Pro Pro Ser Val Tyr Pro Leu Ala Pro 

1X5 120 125 

Gly Ser Ala Ala Gin Thr Asn Ser Met Val Thr Leu Gly Cys Leu Val 

130 135 140 

Lys Gly Tyr Phe Pro Glu Pro Val Thr Val Thr Trp Asn Ser Gly Ser 
145 150 155 160 

Leu Ser Ser Gly Val His Thr Phe Pro Ala Val Leu Gin Ser Asp Leu 

165 170 175 

Tyr Thr Leu Ser Ser Ser Val Thr Val Pro Ser Ser Thr Trp Pro Ser 

180 185 190 

Glu Thr Val Thr Cys Asn Val Ala His Pro Ala Ser Ser Thr Lys Val 

195 200 205 

Asp Lys Lys lie Val Pro Arg Asp Cys 
210 215 

(2) INFORMATION FOR SEQ ID NO: 43: 
(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 169 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 

Thr Ala He Tyr Tyr Cys Ala Arg Asn Tyr Tyr Gly Ser Thr Trp Tyr 

1 5 10 15 

Phe Asp Val Trp Gly Ala Gly Thr Thr Val Gly Gly Ser Leu Arg Leu 



Ser Cys Ala Thr Ser Gly Phe Thr Phe Ser Asp Phe Tyr Met Glu Trp 



20 



25 



30 



35 



40 



45 
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Val Arg Gin Pro Pro Gly Uys Arg Leu Glu Trp lie Ala Ala Ser Arg 

50 55 60 

Asn Lys Gly Asn Lys Tyr Thr Thr Glu Tyr Ser Ala Ser Val Asp , lie 
^5 70 75 80 

Val Met Thr Gin Ser Met Ser Cys Lys Ser Ser Gin Ser Leu Leu Asn 

85 90 95 

Ser Gly Asn Gin Lys Asn Phe Leu Ala Trp Tyr Gin Gin Lys Pro Gly 

100 105 110 

Gin Pro Pro Lys Leu Leu He Tyr Gly Ala Ser Thr Arg Glu Ser Gly 

115 120 125 

Val Pro Asp Arg Phe Thr Glu Asp Leu Ala Val Tyr Tyr Cys Gin Asn 

130 13S 
Asp His Ser Tyr Pro Leu Thr Phe Gly Ala Gly Thr Lys Leu Glu Gin 
"5 ISO 155 160 

Lys Leu He Ser Glu Glu Asp Leu Asn 

165 

(2) INFORMATION FOR SEQ ID NO: 44: 
(±) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 157 amino acxds 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44 

Thr Gly Thr Tyr Tyr Cys Thr Thr Arg Phe Ala Tyr Trp Gly Gin Gly 

15 10 15 

Thr Leu Val Gly Gly Ser Met Lys Leu Ser Cys Ala Ala Ser Gly Phe 

20 25 30 

Thr Phe Ser Asp Ala Trp Met Asp Trp Val Arg Gin Ser Pro Glu Lys 

35 40 45 

Gly Leu Glu Trp Val Ala Glu lie Gly Asn Lys Gly Asn Asn His Ala 
50 55 60 



wo 94/12625 



105 - 



PCT/GB93/02375 



Thr Tyr Tyr Ala Glu Ser Val Asp He Lys Met Thr Gin Ser He Thr 
^5 70 75 80 

Cys Lys Ala Ser Gin Asp He Asn Ser Tyr Leu Ser Trp Phe Gin Gin 

85 90 95 

Lye Pro Gly Lys Ser Pro Lys Thr Leu He Tyr Arg Ala Asn Arg Leu 

100 X05 110 

Val Asp Gly Val Pro Ser Arg Phe Ser Glu Asp Met Gly He Tyr Tyr 

115 120 125 

Cys Leu Gin Tyr Asp Glu Phe Pro Arg Thr Phe Gly Gly Gly Thr Lys 

130 135 140 

Leu Glu Gin Lys Leu He Ser Glu Glu Asp Leu Asn End 
145 150 155 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

<A) I*ENGTH: 211 amino acids 

<B) TYPE: protein 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(xi) SEC2UENCE DESCRIPTION SEQ ID NO: 45: 

r 

1 Ser Arg Leu Ser He Ser Lys Asp Asn Ser Lys Ser Gin Val Phe Leu 16 

17 Lys Met Asn Ser Leu His Thr Asp Asp Thr Ala Arg Tyr Tyr Cys Ala 32 

33 Arg Glu Arg Asp Tyr Arg Leu Asp Tyr Trp Gly Gin Gly Thr Thr Leu 48 

49 Thr Val Ser Ala Pro Ser Gin Ser Leu Ser He Thr Cys Thr Val Ser 64 

65 Gly Phe Ser Leu Thr Gly Tyr Gly Val Asn Trp Val Arg Gin Pro Pro 80 

81 Gly Lys Gly Leu Glu Trp Leu Gly Met He Trp Gly Asp Gly Asn Thr 96 

97 Asp Tyr Asn Ser Ala Leu Lys Asp lie. Gin Met Thr Gin Ser Pro Ala 112 

113 Ser Leu Ser Ala Ser Val Gly Glu Thr Val Thr He Thr Cys Arg Ala 128 

129 Ser Gly Asn He His Asn Tyr Leu Ala Trp Tyr Gin Gin Lys Gin Gly 144 
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145 



Lys Ser Pro Gin Leu Leu Val Tyr Tyr 



Thr Thr Tlir Leu Ala Asp Gly 



161 Val Pro Ser Arg Phe Ser Gly Ser Gly Ser Gly Thr Gin Tyr Ser Leu 
177 Lys lie Asn Ser Leu Gin Pro Glu Asp Phe Gly Ser Tyr Tyr Cys Gin 
193 His Phe Trp Ser Thr Pro Arg Thr Phe Gly Gly Gly Thr Lys Leu Glu 
209 lie Lys Arg 211 
(2) INFORMATION FOR SEQ ID NO:46: 
(i) SEQUEKTCB CHARACTERISTICS: 

(A) LENGTH: 39 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION SEQ ID NO: 46: 
1 GACTATCCCGGGTCTTCTGGCTCCCGTCTGAGCATCAGC 39 

(2) INFORMATION FOR SEQ ID NO: 47: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linesur 

(xi) SEQUENCE DESCRIPTION SEQ ID NO: 47: 
1 GGCACCACGCTGACCGTCTCCGCGCCCTCACAGAGCCTGTCCATC 45 

(2) INFORMATION FOR SEQ ID NO: 48; 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 46 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION SEQ ID NO: 48: 



CAGACTATAATTCAGCTCTCAAAGACATCCAaATGACCCAGTCTCC 



46 
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(2) INFORMATION FOR SEQ ID NO:49: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45. nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION SEQ ID NO: 49: 

L GATGGACAGGCTCTGTGAGGGCGCGGAGACGGTCAGCGTGGTGCC 45 
(2) INFORMATION FOR SEQ ID NO: 50: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION SEQ ID NO:S0: 
L <3GW^GACTGGGTCATCTGGATGTCTTTGAGAGCTGJUITTATAGTCTG 46 



(2) INFORMATION FOR SEQ ID NO: 51: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION SEQ ID NO: 51: 



CGACGGCCAGTGAATTCGAGCTGG 24 
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(2) INFORMATION FOR SEQ ID NO: 52: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION SEQ ID NO: 52: 

1 GCAACTAGTTAACTAGTACGCAAGTTCACGTAAAAAGGRWWWPWWWWATGTCCCGTCT^ 60 
61 GCATCAGCAAGG 72 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 738 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION SEQ ID NO: 53: 

1 GAC TAT CCC GGG TCT TCT GGC TCC CGT CTG AGC ATC AGC AAG GAC AAC 48 
1 Asp Tyr Pro Qly Ser Ser Gly Ser Arg Leu Ser He Ser Lys Asp Asn 16 

49 TCC AAa AGC CAA GTT TTC TTA AAA ATG AAC AGT CTG CAC ACT GAT GAC 96 
17 Ser Lys Ser Gin Val Phe Leu Lys Met Asn Ser Leu His Thr Asp Asp 32 

97 ACA GCC AGG TAC TAC TGT GCC AGA GAG AGA GAT TAT AGG CTT GAC TAC 144 
33 Thr Ala Arg Tyr Tyr Cys Ala Arg Glu Arg Asp Tyr Arg Leu Asp Tyr 48 

145 TGG GGC CAA GGC ACC ACG CTG ACC GTC TCC GCG CCC TCA CAG AGC CTG 192 



49 Trp Gly Gin Gly Thr Thr Leu Thr Val Ser Ala Pro Ser Gin Ser Leu 



64 



193 TCC ATC ACA TGC ACC GTC TCA GGG TTC TCA TTA ACC GGC TAT GGT GTA 240 
65 Ser He Thr Cys Thr Val Ser Gly Phe Ser Leu Thr Gly Tyr Gly Val 80 
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241 AAC TGG GTT CGC CAG CCT CCA GGA AAG GGT CTG GAG TGG CTG GGA ATG 288 

81 Asn Trp Val Arg Gin Pro Pro Gly Lys Gly Leu Glu Trp Leu Gly Met 96 

289 ATT TGG GGT GAT GGA AAC ACA GAC TAT AAT TCA GCT CTC AAA GAC ATC 336 

97 He Trp Gly Asp Gly Asn Thr Asp Tyr Asn Ser Ala Leu Lys Asp He 112 

337 CAG ATG ACC CAG TCT CCA GCC TCC CTT TCT GCG TCT GTG GGA GAA ACT 384 

113 Gin Met Thr Gin Ser Pro Ala Ser Leu Ser Ala Ser Val Gly Glu Thr 128 

385 GTC ACC ATC ACA TGT CGA GCA AGT GGG AAT ATT CAC AAT TAT TTA GCA 432 

129 Val Thr He Thr Cys Arg Ala Ser Gly Asn He His Asn Tyr Leu Ala 144 

433 TGG TAT CAG CAG AAA CAG GGA AAA TCT CCT CAG CTC CTG GTC TAT TAT 480 

145 Trp Tyr Gin Gin Lys Gin Gly Lys Ser Pro Gin Leu Leu Val Tyr Tyr 160 

481 ACA ACA ACC TTA GCA GAT GGT GTG CCA TCA AGG TTC AGT GGC AGT GGA 528 

161 Thr Thr Thr Leu Ala Asp Gly Val Pro Ser Arg Phe Ser Gly Ser Gly 176 

529 TCA GGA ACA CAA TAT TCT CTC AAG ATC AAC AGC CTG CAA CCT GAA GAT 576 

177 Ser Gly Thr Gin Tyr Ser Leu Lys He Asn Ser Leu Gin Pro Glu Asp 192 

577 TTT GGG AGT TAT TAG TGT CAA CAT TTT TGG AGT ACT CCT CGG ACG TTC 624 

193 Phe Gly Ser Tyr Tyr Cys Gin His Phe Trp Ser Thr Pro Arg Thr Phe 208 

625 GGT GGA GGG ACC AAG CTC GAG ATC AAA CGG GAA CAA AAA CTC ATC TCA 672 

209 Gly Gly Gly Thr Lys Leu Glu He Lys Arg Glu Gin Lys Leu He Ser 224 

673 GAA GAG GAT CTG AAT TAA TAA TGA TCA AAC GGT AAT AAG GAT CCA GCT 720 

225 Glu Glu Asp Leu Asn 240 



721 



CGA ATT CAC TGG CCG TCG 738 
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(2) INFORMATION FOR SEQ ID NO: 54: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 244 amdLno acids 

(B) TYPE: protein 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQXJENCE DESCRIPTION SEQ ID NO: 54: 

1 Met Lys Tyr Leu Leu Pro Thr Ala Ala Ala Gly Leu Leu Leu Leu Ala 16 

17 Ala Gin Pro Ala Met Ala Ser Arg Leu Ser lie Ser Lys Asp Asn Ser 32 

33 Lys Ser Gin Val Phe Leu Lys Met Asn Ser Leu His Thr Asp Asp Thr 48 

49 Ala Arg Tyr Tyr Cys Ala Arg Glu Arg Asp Tyr Arg Leu Asp Tyr Trp -64 

65 Gly Gin Gly Thr Thr Leu Thr Val Ser Ala Pro Ser Gin Ser Leu Ser 80 

81 He Thr Cys Thr Val Ser Gly Phe Ser Leu Thr Gly Tyr Gly Val Asn 96 

97 Trp Val Arg Gin Pro Pro Gly Lys Gly Leu Glu Trp Leu Gly Met He 112 

113 Trp Gly Asp Gly Asn Thr Asp Tyr Asn Ser Ala Leu Lys Asp He Gin 128 

129 Met Thr Qln Ser Pro Ala Ser Leu Ser Ala Ser Val Gly Glu Thr Val 144 

145 Thr He Thr Cys Arg Ala Ser Gly Asn He His Asn Tyr Leu Ala Trp 160 

161 Tyr Gin Gin Lys Gin Gly Lys Ser Pro Gin Leu Leu Val Tyr Tyr Thr 176 
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177 Thr .Thr Leu Ala Asp Gly Val Pro Ser Arg Plie Ser Gly Ser Gly Ser X92 
193 Gly Thr Gin Tyr Ser Leu Lys He Asn Ser Leu Gin Pro Glu Asp Phe 208 
209 Gly Ser Tyr Tyr Cys Gin His Phe Trp Ser Thr Pro Arg Thr Phe Gly 224 
225 Gly Gly Thr Lys Leu Glu lie Lys Arg Glu Gin Lys Leu He Ser Glu 240 
241 Glu Asp Leu Asn 244 

(2) INFORMATION FOR S^ ID NO:55 
(i) SEQUENCE C3IARACTERISTICS : 

(A) LENGTH: 211 amino acids 

(B) TYPE: amino acids 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 55: 

Ser Arg Leu Ser He Ser Lys Asp Asn Ser Lys Ser Gin Val Phe Leu 
^ 5 10 15 

Met Asn Ser Leu His Thr Asp Asp Thr Ala Arg Tyr Tyr Cys Ala Arg 

20 25 30 

Glu Arg Asp Tyr Arg Leu Asp Tyr Trp Gly Gin Gly Tyr Thr Leu Thr 

35 40 45 

Val Ser Ala Pro Ser Gin Ser Leu Ser He Thr Cys Thr Val Ser Gly 

50 55 60 

Phe Ser Leu Thr Gly Tyr Gly Val Asn Trp Val Arg Gin Pro Pro Gly 
65 70 75 80 

Lys Gly Leu Glu Trp Leu Gly Met He Trp Gly Asp Gly Asn Thr Asp 

85 90 95 

Tyr Asn Ser Ala Leu Lys Asp He Gin Met Thr Gin Ser Pro Ala Ser 
100 105 110 
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Leu Ser Ala Ser Val Gly Glu Thr Val Thr lie Thr Cys Arg Ala Ser 

115 120 125 

Gly Asn lie His Asn Tyr Leu Ala Trp Tyr Gin Gin Lys Gin Gly Lys 

135 140 145 

Ser Pro Gin Leu Leu Val Tyr Tyr Thr Thr Thr Leu Ala Asp Gly Val 

150 155 160 

Pro Ser Arg Phe Ser Gly Ser Gly Ser Gly Thr Gin Tyr Ser Leu Lys 

165 170 175 

He Asn Ser Leu Gin Pro Glu Asp Phe Gly Ser Tyr Tyr Cya Gin His 

180 185 190 

Phe Trp Ser Tlir Pro Arg Thr Phe Gly Gly Gly Thr Lys Leu Glu He 

195 200 205 

Lys Arg 
210 



(2) INFORMATION FOR SEQ ID NO: 56 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 211 amino acids 

(B) TYPE: amino acids 
(D) TOPOLOGY: linear 

(xi) SEQDENCE DESCRIPTION: SEQ ID NO 56: 

Arg Phe Ser Gly Ser Gly Ser Gly Thr Gin Tyr Ser Leu Lys He Asn 

^5 10 15 

Ser Leu Gin Pro Glu Asp Phe Gly Ser Tyr Thr Cys Gin His Phe Trp 

20 25 30 

Ser Thr Pro Arg Thr Phe Gly Gly Gly Thr Lys Leu Glu He Lys Ala 

35 40 45 

Ser Val Gly Glu Thr Val Thr He Thr Cys Arg Ala Ser Gly Ash He 
50 55 QQ 

His Asn Tyr Leu Ala Trp Tyr Gin Gin Lys Gin Gly Lys Ser Pro Gin 
^5 70 75 80 
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Leu Leu Val Tyr Tyx Thr Thr Thr Leu Ala Asp Gly Val Pro Ser Gin 

85 90 95 

Val Gin Leu Lys Glu Ser Gly Pro Gly Leu Val Ala Pro Ser Glh Ser 

100 105 110 

Leu Ser lie Thr Cys Thr Val Ser Gly Phe Ser Leu Thr Gly Tyr Gly 

115 120 125 

Val Asn Trp Val Arg Gin Pro Pro Gly Lys Gly Leu Glu Trp Leu Gly 

130 135 140 

Met He Trp Gly Asp Gly Asn Thr Asp Tyr Asn Ser Ala Leu Lys Ser 
"5 150 155 160 

Arg Leu Ser He Ser Lys Asp Asn Ser Lys Ser Gin Val Phe Leu Lys 

165 170 175 

Met Asn Ser Leu His Thr Asp, Asp Thr Ala Arg Tyr Tyr Cys Ala Arg 

180 185 , 190 

Glu Arg Asp Tyr Arg Leu Asp Tyr Trp Gly Gin Gly Thr Thr Leu Thr 

195 200 205 

Val Ser Ser 
210 



(2) INFORMATION FOR SEQ ID NO;57 
(i) SEQXJENCE CHARACTERISTICS: 

(A) LENGTH: 163 amino acids 

(B) TYPE: amino acids 
(D) TOPOLOGV: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO 57 1 



Thr Asp Asp Thr Als Arg Tyr Tyr Cys Ala Arg Glu Arg Asp Tyr Arg 
1 5 10 15 

Leu Asp Tyr Trp Gly Gin Gly Thr Thr Leu Thr Val Ser Ala Pro Ser 
20 25 30 
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Gin Sex Leu Ser lie Thr Cys Tlir Val Ser Gly Phe Ser Leu Thr Gly 

35 40 45 

Tyr Gly Val Asn Trp Val Arg Gin Pro Pro Gly Lys Gly Leu Glu Trp 

50 55 60 

Leu Gly Met lie Trp Gly Asp Gly Asn Thr Asp Tyr Asn Ser Ala Leu 
^5 70 75 .80 

Lys Asp lie Gin Met Thr Gin Ser lie Thr Cys Arg Ala Ser Gly Asn 

85 90 95 

He His Asn Tyr Leu Ala Trp Tyr Gin Gin Lys Gin Gly Lys Ser Pro 

100 105 110 

Gin Leu Leu Val Tyr Try Thr Thr Thr Leu Ala Asp Gly Val Pro Ser 

115 120 125 

Arg Asn Ser Leu Gin Pro Glu Asp Phe Gly Ser Tyr Try Cys Gin His 
"0 135 140 145 

Phe Trp Ser Thr Pro Arg Thr Phe Gly Gly Gly Thr Lys Leu Glu He 

ISO 155 160 

Lys Arg 



(2) INFORMATION FOR SEQ ID NO: 58 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 249 amdLno acids 

(B) TVPE: amino acids 
(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: 

Arg Phe Thr He Ser Arg Asn Asp 
1 5 
Met Asp Ser Leu Arg Pro Glu Asp 
20 



SEQ ID NO 58: 

Ser Lys Asn Thr Leu Phe Leu Gin 

10 15 
Thr Gly Val Tyr Phe Cys Ala Arg 
25 30 
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Asp Gly Qly His Gly Phe Cys Ser Ser Ala Ser Cys Phe Gly Pro Asp 

35 40 45 

Tyr Trp Gly C31n Gly Thr Pro Val Tfar Val Ser Gin Pro Gly Arg Ser 

50 55 €0 

Leu Arg Leu Ser Cys Ser Ser Ser Gly Phe lie Phe Ser Ser Tyr Ala 

70 75 80 

Met Tyr Trp Val Arg Gin Ala Pro Gly Lys Qly Leu Glu Trp Val Ala 

85 90 95 

lie lie Trp Asp Asp Gly Ser Asp Gin His Tyr Ala Asp Ser Val Lys 

100 105 110 

Gly Gin Ser Val Leu Thr Gin Pro Pro Ser Ala Ser Gly Thr Pro Gly 

13.5 120 125 

Gin Arg Val Thr lie Ser Cys Ser Gly Thr Ser Ser Asn He Gly Ser 
"0 135 3^40 

Ser Thr Val Asn Trp Tyr Gin Gin Leu Pro Gly Met Ala Pro Lys Leu 

ISO 155 160 

Leu He Tyr Arg Asp Ala Met Arg Pro Ser Gly Val Pro Asp Arg Phe 

170 175 
Ser Gly Ser Lys Ser Qly Ala Ser Ala Ser Leu Ala He Gly Gly Leu 

180 135 ISO 

Gin. Ser Glu Asp Glu Thr Asp Tyr Tyr Cys Ala Ala Trp Asp Val Ser 

199 200 205 

Leu Asn Ala Tyr Val Phe Gly Thr Gly Thr Lys Val Thr Val Leu Gly 
210 215 220 

Gin 
225 
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(2) INFORMATION FOR SEQ ID NO:S9 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 226 amino acids 

(B) TYPE: amino acids 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 59 j 



Arg Phe Ser Gly Ser Lys Ser Gly Ala Ser Ala Ser Leu Ala He Gly 

1 5 10 15 

Gly Leu Gin Ser Glu Asp Glu Thr Asp Tyr Tyr Cys Ala Ala Trp Asp 

20 25 30 

Val Ser Leu Asn Ala Tyr Val Phe Gly Thr Gly Thr Lys Val Gly Thr 

35 40 , 45 

Pro Gly Gin Arg Val Thr lie Ser Cys Ser Gly Thr Ser Ser Asn He 

50 55 SO 

Gly Ser Ser Thr Val Asn Trp Tyr Gin Gin Leu Pro Gly Met Ala Pro 
65 . 70 75 80 

Lys Leu Leu He Tyr Arg Asp Ala Met Arg Pro Ser Gly Val Pro Asp 

85 90 95 

Glu Val Gin Leu Val Gin Ser Gly Gly Gly Val Val Gin Pro Gly Arg 

100 105 110 

Ser Leu Arg Leu Ser Cys Ser Ser Ser Gly Phe He Phe Ser Ser Tyr 

120 125 130 

Ala Met Tyr Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Val 

135 140 145 

Ala He He Trp Asp Asp Gly Ser Asp Gin His Tyr Ala Asp Ser Val 
150 155 160 165 

Lys Gly Arg Phe Thr He Ser Arg Asn Asp Ser Lys Asn Thr Leu Phe 

170 175 ISO 

Leu Gin Met Asp Ser Leu Arg Pro Glu Asp Thr Gly Val Tyr Phe Cys 
185 190 195 
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Ala Arg Asp Gly Gly His Gly Phe Cys Ser Ser Ala Ser Cys Phe Gly 

200 205 210 

Pro Asp Tyr Trp Gly Gin Gly Thr Pro Val Thr Val Ser 
215 220 225 

(2) INFORMATION FOR SEQ ID NO:60 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 224 amino acids 

(B) TYPE: amino acids 
(D) TOPOLOGY I linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO €0i 

Arg Leu Ser lie Ser Lys Asp Asn Ser Lys Ser Gin Val Phe Phe Lys 

1 5 10 15 

Met Asn Ser Leu Gin Ala Asn Asp Thr Ala lie Tyr Tyr Cys Ala Arg 

20 25 30 

Arg Ser Pro Ser Tyr Tyr Arg Tyr Asp Gly Ala Gly Pro Tyr Tyr Ala 

35 40 45 

Met Asp Tyr Trp Gly Gin Pro Ser Gin Ser Leu Ser lie Thr Cys Thr 

50 55 60 

Val Ser Gly Phe Ser Leu Thr Ser Tyr Gly Val His Trp Val Arg Gin 
^5 70 80 85 

Ser Pro Gly Lys Gly Leu Glu Trp Leu Gly Val He Trp Ser Gly Gly 

90 35 100 

Ser Thr Asp Tyr Asn Ala Ala Phe He Ser Asp Val Leu Met Thr Gin 

105 110 115 

Gin Thr Pro Leu Ser Leu Pro Val Ser I*eu Gly Asp Gin Ala Ser 11^ 

120 125 130 

Ser Cys Arg Ser Ser Gin Ser He Val His Ser Asn Gly Asn Thr Tyr 

135 140 145 

Leu Glu Trp Tyr Leu Gin Lys Pro Gly Gin Ser Pro Lys Leu Leu He 
150 155 160 1S5 
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Tyr Lys Val Ser Asn Arg Phe Ser Gly Val Pro Asp Arg Phe Ser Gly 

170 175 .180 

. ser Gly Ser Gly Thr Asp Phe Thr Leu Lys lie Ser Arg Val Glu Ala 
185 190 195 

Glu Asp Leu Gly Val Tyr Tyr Cys Phe Gin Gly Ser His Val Pro Tyr 

200 205 210 

Thr Phe Gly Gly Gly Thr Lys Leu Glu He Lys 
215 220 



(61) INFORMATION FOR SEQ ID NO: 61 
(i) SEQXJENCE CHARACTERISTICS: 



(A) . LENGTH: 22B amino acids 

(B) TYPE: amino acids 
(D) TOPOLOGY: linear 

(xi) SEQX3ENCE DESCRIPTION: SEQ ID NO 61; 



Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Lys He Ser 

^ 5 10 15 

Arg Val Glu Ala Glu Asp Leu Gly Val Tyr Tyr Cys Phe Gin Gly Ser 

20 25 30 

His Val Pro Tyr Thr Phe Gly Gly Gly Thr Lys Leu Glu He Pro Val 
35 40 45 

Ser Leu Gly Asp Gin Ala Ser He Ser Cys Arg Ser Ser Gin Ser He 
50 55 

Val His Ser Asn Gly Asn Thr Tyr Leu Glu Trp Tyr Leu Gin Lys Pro 
®5 70 75 80 

Gly Gin Ser Pro Lys Leu Leu He Tyr Lys Val Ser Asn Arg Phe Ser 

85 90 95 

Gly Val Pro Asp Gin Val Gin Leu Lys Gin Ser Gly Pro Gly Leu Val 

100 105 110 

Gin Pro Ser Gin Ser Leu Ser He Thr Cys Thr Val Ser Gly Phe Ser 
115 120 125 
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Leu Thr ser Tyr Gly Val His Trp Val Arg Gin Ser Pro Gly Lys Qly 
"0 

Leu Glu Trp Leu Gly Val lie Trp Ser Gly Gly Ser Thr Aap Tyr Asn 

ISO ^gj, 
Ala Ala Phe lie Ser Arg Leu Ser lie Ser Lys Asp Asn Ser Lys Ser 

165 170 j^75 

Gin val Phe Phe Lys Met Asn Ser Leu Gin Ala Asn Asp Thr Ala lie 
180 135 

Tyr Tyr cys Ala Arg Arg Ser Pro Ser Tyr Tyr Arg Tyr Asp Gly Ala 

"5 200 205 

Sly Pro Tyr Tyr Ala Met Asp Tyr Trp Gly Gin Gly Thr Ser Val 
210 220 225 
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Table la 

Ca-Cof distances for linker sections of MPC Fv fragment: 
L55-84 and 1,104-107 versus Ll-21 Canonical numbering 

Distances up to 17A reported 

Light Chain 

10 11 12 13 14 15 16 17 18 19 20 21 
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14 
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17 
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17 
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17 
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1 T 

JU / 


X^ 


X2 


10 
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11 


14 


16 
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X3 


X J 


XU 
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10 


11 


13 


15 


17 


17 


72 
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16 


12 


11 
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11 


13 
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11 


12 


13 


14 


13 


74 
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14 


13 


13 


15 


14 


12 


75 
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14 


15 


15 


12 


76 
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14 
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11 
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13 
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13 
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14 
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13 


11 


83 
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13 


11 


10 


84 












14 


15 


14 


12 


10 


10 


104 










17 


13 


11 


10 
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105 












16 


14 


12 


11 
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106 














17 


15 


14 


11 
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107 
















16 


16 


12 


10 



17 17 16 17 17 

15 14 15 14 15 16 17 

16 14 15 13 14 14 16 
17 15 15 12 11 11 9 10 12 14 
17 15 17 14 13 12 10 10 10 11 

17 16 16 14 11 11 9 11 

16 13 11 9 10 
15 13 10 11 
16 12 12 
16 15 
. 17 
16 14 
16 12 11 
17 14 10 9 
17 17 16 13 10 6 6 

15 15 16 16 14 11 8 5 6 

14 13 15 13 13 10 7 5 4 7 

13 11 13 10 10 8 6 6 7 9 

14 12 12 10 8 7 5 7 9 12 

13 10 10 6 4 5 5 8 11 14 
10 8 7 4 5 5 6 8 11 13 
12 10 9 5 6 8 10 11 14 17 
12 10 9 7 9 11 13 14 17 17 

14 13 13 10 12 14 15 15 17 17 
12 11 11 9 11 12 12 11 13 13 
10 10 11 10 13 13 14 11 14 14 



6 7 10 10 13 12 12 9 11 10 

5 6 8 9 13 12 13 12 13 13 

6 5 5 6 10 10 12 12 14 15 
6 5 5 7 11 11 13 13 16 17 
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Table lb 

Cof-Ca distances for linker sections of MPC Pv fragment 
H60-88 versus Ll-21 Canonical numbering 

Distances up to 17A reported 
Light Chain 

^ 2 3 ^ 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 

Heavy 

60 10 13 14 16 

61 10 14 15 17 

62 12 15 16 
.63 15 

64 15 

65 

66 

67 

68 

69 

70 

71 

72 

73 

74 

75 

76 

77 

78 

79 

80 

81 

82 

63 

84 

85 

86 

87 

88 
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Table Ic 



Distances up to 17A reported 
Heavy Chain 

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 

Heavy 

60 17 
61 

62 16 16 17 16 17 17 

63 17 17 17 15 14 15 13 14 14 14 17 

64 17 16 17 15 16 16 16 

65 16 14 15 13 15 15 17 

66 ' 17 17 15 15 13 11 11 10 11 12 14 16 

67 16 16 14 15 17 17 15 16 14 13 12 10 10 10 11 13 
68 16 16 15 16 17 17 15 14 11 11 9 10 12 

69 16 16 14 15 14 16 17 14 12 10 10 10 

70 16 16 14 15 15 15 14 10 11 10 

71 17 16 14 14 13 14 15 15 12 11 9 

72 17 14 15 14 15 16 17 13 13 10 

73 17 16 15 16 16 17 16 16 13 

74 17 17 16 17 17 14 

75 17 16 14 13 13 14 15 17 16 15 11 

76 15 12 11 10 11 12 14 16 15 14 10 

77 16 14 12 9 9 9 11 13 16 15 12 10 7 

78 17 15 13 10 10 9 11 12 15 17 14 10 9 6 

79 16 12 12 9 10 10 12 16 17 17 13 10 7 6 5 

80 14 13 10 10 10 11 15 16 15 17 17 16 14 10 B 6 5 6 

81 16 13 12 10 12 14 15 13 15 15 13 11 7 6 4 6 9 

82 14 13 11 11 13 13 11 12 12 10 8 5 5 6 8 11 

83 17 14 14 15 14 11 11 10 7 6 5 7 9 11 14 

84 16 15 15 13 10 9 7 4 6 6 9 12 14 

85 17 14 12 12 11 8 7 5 5 6 7 9 12 13 17 

86 ^ 17 15 14 12 9 8 5 7 9 10 12 15 16 

87 17 14 13 11 10 8 7 10 11 12 14 17 17 

88 15 14 13 12 12 10 12 13 14 15 17 17 

88 17 16 14 12 12 11 10 11 10 11 12 11 11 14 13 16 

88 16 15 13 10 9 9 10 11 11 13 13 12 12 14 13 16 

88 17 14 14 13 10 10 11 12 14 14 16 15 14 13 15 12 15 



108 16 13 10 9 8 5 6 9 8 14 IS 17 15 14 11 12 10 12 

109 15 12 10 8 5 5 6 8 10 12 13 12 10 9 11 9 12 

110 15 14 11 8 6 5 7 9 10 12 11 11 11 14 13 16 

111 16 13 10 8 6 5 6 6 9 9 10 10 14 14 17 

112 16 12 10 6 6 5 6 10 10 12 13 17 17 

113 15 12 9 8 5 6 10 10 13 15 
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Table Id : Ca-Cof distances for linker sections of MPC Fv fragment 
L5S-84 versus HI -21 Canonical nximbering 

Distances up to 17A reported 
Heavy Chain 

1 2 3 4 5 e 1 8 9 10 11 12 13 14 15 16 17 18 19 20 21 



Light 
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17 


15 


17 
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Table le 

Ca-Ca distances for linker sections of MPC Fv fragment 
IiS5-84 versus ia04-107 H60-88 versus H108-113 Canonical 

numbering 

Distances up to 17A reported 



Ligbt 

55 

56 

57 

58 

59 

60 

61 

62 

63 

64 

65 

66 

67 

68 

69 
. 70 

71 



Light Cbain 
104 105 106 107 



17 
17 



15 
13 
16 



17 
16 



16 
16 



72 


15 








73 


12 


16 


17 




74 


12 


15 


15 




75 


10 


13 


13 


16 


76 


13 


15, 


14 


16 


77 


12 


13 


12 


14 


78 


9 


10 
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11 


79 


10 


10 


8 


11 


80 
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7 


10 
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10 


10 
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13 
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9 


12 


83 
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6 


7 


11 
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6 


8 


10 


14 



Heavy 
60 
61 
62 
63 
64 
65 
66 
67 
68 
69 
70 
71 
72 
73 
74 
75 
76 
77 
78 
79 
80 
81 
82 
83 
84 
85 
86 
87 
88 
88 
88 
88 



Heavy Chain 
108 109 110 111 112 113 



16 17 15 
17 15 16 15 
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Table If ; Ca-Ca distances for linker sections of MPC Fv fragment 
Intra Section relationships Canonical numbering 

Distances up to 17A reported 



Light Chain Heavy Chain 



Light 
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Table If continued 



Heavy Heavy Chain 
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TABLE 2 

DXJPI-EX OI.IGONUCI.EOTIDES NUMBER OF BASES IN 

TOP STRAND BOTTOM STRAND 



A 


SEQ 


ID 


No 


1 


+ . 


SEQ 


ID 


No 


20 


25 




34 


B 


SEQ 


ID 


No 


2 


+ 


SEQ 


ID 


No 


19 


62 




63 


C 


SEQ 


ID 


No 


3 


+ 


SEQ 


ID 


No 


18 


59 




58 


D 


SEQ 


ID 


No 


4 


+ 


SEQ 


ID 


No 


17 


60 




61 


E 


SEQ 


ID 


No 


5 


+ 


SEQ 


ID 


No 


16 
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SEQ 


ID 


No 


€ 




SEQ 


ID 


No 


15 


62 




53 


G 


SEQ 


ID 


No 


7 




SEQ 


ID 


No 


14 


61 




60 


H 


SEQ 


ID 


No 


8 




SEQ 


ID 


No 


13 


51 




62 


I 


SEQ 


ID 


No 


9. 




SEQ 


ID 


No 


12 


60 




59 


J 


SEQ 


ID 


Nolo 




SEQ 


ID 


No 


11 


52 




51 


I 


A + 


B + C 


+ 


D 








206 




216 




II 


E > 


F + G 












184 




173 




III 


H + 


I - 


f J 












163 




172 





V 
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INDICATTONS REUVTTNG TO A DEPOSITBD MICIROOROANISM 

(PCTRuIcl36tr) 



A. The indicrtions made below relate to the mieroorgankm Tfeferw^ri ir^ m t h> *Uy^ ptjpn 



pICI 10T9 



B. IDENTIFICATION OF DEPOSIT 



Fimfaer deposits are idoitiGed on an additional sheet 



Name of d^xxsitary institution 

gATIOHAL COLLECnOH OF INDUSTRIA L AND MARINE BACTERIA 
Addfcss of dtqmitaty institution (Zi^^ 

23 St Machar Drive 
Aberdeen AB2 IRY 
Scotland 
UK 



1 Date of deposit 


Accession Number 




19 Feb 91 




U037O 1 



C ADDmONAL INDICATTONS Hiis inlbnnatian b continued on an additional sheet Q 



-in respea of aU designated states In %v«cA such aciioo is possible and to 
the extern that it is legaMypermissibletindof the lawof the desigoatedstae 
rt ts requested that a sampTe of the depended inicn>^)igar^s) be made* 
avaOablo only by the issue thefeof to an Independent expert in accocdance 
%wth the relevant patent legtsfation. ERC Hbte 28(4), UJC Rute 17ra) 
Austrafian Regulation 3^(3) and geneially similar provisions awmrs 
muiandts for any other designated state*. 



p. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE <fflh> i«£a,io«c^^forcaiesig«cicdStaus} 



E. SEPARATE FURmSHmO OF INDICATIONS fWA/b«fc//««iWfia^^ 



icttUonse.^^ ^Accession 



0^ 



For receiving OfOce use only 



This sheet wasyteceived with the international application 



Authorized officer 



Fonn PCr/RO/I34 (July 1992) 



For Intemationai Bureau use only 



This sheet was received by the Internationa! Bureau 

0 2 OECEMBRE 1993 



on: 



Authorized oflicer 
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INDICATIONS REXATING TO A DEPOSITED MICROORGANISM 

(PCTRuIcl36£r) 



I A. The indications nude below relate to the microo rga nism referred to in the description 
on page .line 1^ . 



ICt Zbfa 



S. IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet | [ 



Name of depositary instttotion 

UATIOKAL COLLECTION OF INDUSTRIAL AND l^^ARINE BACTERIA 
Address of depositary instittttion (UtchtHmg posul <odc mndecunuy) 



23 St Machax Drive 
Aberdeen AB2 IRY 
Scotland 
UK 



1 Date of deposit 




Accession Number 






11 SEPT 93 




U0589 



C, ADDITIONAL INDICATIONS (ica^hhuJ^iffuH appGcablc) Ibis information b continued on an addiUonal sheet 



"In respect of an des^nated states in vrfacii such aalon is possible and to 
the extent that it is legally pernusstbia tmderthe law of the des^nated state, 
it is requested that a sample ot me deposited nucro-of9antsm(s) be made 
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CIAIMS 

1. A ligand binding variable domain (V-min) comprising: 

A) a framework region comprising: 

(a) a cyclically permuted central fi-barrel; 

(b) outer beta- sheet segments and; 

(c) linker sections; and 

B) complementarity determining regions (CDRs) ; 

whereby said central S-barrel. optionally in combination with said 
outer fi- sheet segments, orients said CDRs in space for binding ligand; 
the linker sections provide linking between A) & B) to form the V-min, 
in which the framework region ijer se has upto cibout 180 amino acids, 

2 . A V-min according to claim 1 wherein the central beta barrel 
has intact strands, 

3. A ligand binding variable domain (V-min) comprising: 

(a) a cyclically permuted central IS-barrel having intact strands; 

(b) outer beta-sheet segments; 

(c) complementarity determining regions (CDRs) ; amd 

(d) linker sections for linking integers (a) , (b) & (c) into a v-min, 
whereby said central S-barrel, optionally in combination with said 
outer fi-sheet segments, orients said CDRs in space for binding ligeuid. 

4. A V-min according to claim 3 wherein framework integers (a), 
(b) and (d) per se have upto about 180 amino acids. 

5- A V-min according to any one of claims 1-4 wherein the CDRs 

substantially reproduce the spatial orientation of CDRs from a parent 
antibody cuid substantially retain the ligand binding specificity and 
affinity of the parent antibody. 

6. . A V-min according to any one of claims 1-5 wherein the V-min 

is hvimanised. 
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7. A V-min according to any one of claims X-6 wherein the 
V-mJLn is in the form of one continuous polypeptide. 

8. A V-min according to any one of claims 1-7 comprising the 
following, canonically numbered, core segments; light chain residues 
21-55; light chain residues 84-104; heavy chain residues 21-60 and; 
heavy chain residues 88-109. 

9. A multivalent ligand binding moiety comprising at least one 
V-rain as defined in any one of claims 1-8. 

10. An conjugate comprising a V-min as defined in any one of 
claims 1-8 and an effector molecxile. 

11- A polynucleotide ^sequence encoding a V-min as defined in any 

one of claims 1-8. 

12 . An e^qpression vector comprising a polynucleotide sequence as 
defined in claim 11. 

13 . A host cell comprising an expression vector as defined in 
claim 12. 

* 

14. A method of producing a V-min comprising culture of a host 
cell as defined in claim 13. 

15. An immxmotoxin pharmaceutical composition comprising a v-min 
as defined in any one of claims 1-8 in association with a 
pharmaceutical excipient or carrier. 

16 . A radioimaging composition conqprising a V-min as defined in 
any one of claims 1-8 in association with a pharmaceutical excipient 
or carrier. 
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17. An antibody-enzyme phcunnaceucical composition for ADEPT 
comprising a V-min as defined in any one of claims 1-8. 

18. A method of treatment of a tumour con^rising the 
ad mini stration to a human or animal in need of such treatment of a 
pharmaceutically effective amount of a pharmaceutical composition 
defined in any one of claims 15 or 17, 

19. Use of a radioimaging composition defined in claim 16 to 
produce a radioimage as an aid to diagnosis of a condition in a human 
or animal. 
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Ploure 1 Beta Strand ArrranQement: T>^|poal6bulin Variable Regions 
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Structiirally important residues Cysteines (Ii23,Ij88; H22,H92) indicated 



by c also Tryptophans Ii35 and H36 are represented by W. 
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Figure 2 The qg^Q 9^m^^%§ 
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The core segmentis are emboldened 
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Figure 3 Beta Strrand Aarranaeii K^g^t Immunoal obul in Variable Regions 
The V-mln Construct: 4m 



LIGHT CHAIN 



L3 



97 



I 

H64##l - -I- 4 



I « 

/ : 

I 

I 6 

I \ # 
9 I # 



LI 

90 ♦ 33 
* 

25 

: 
: 
t 

t , 
t 
t 
t 

21 



L2 



50 



52 



I 

104> 



19 



84 



69--I--68 53 
I 



55 



/ 

I 

\ 

163 . 
«| . 
# I t 
76 # / t 
.# 1 61- 
79- I 
. 40 41 



13 



107 



HEAVY CHAIN 

H3 HI R2 

* 95 * 33 52 56 
102 

1 1--3 j 25 1 75--I--74 



8 



12^ 



18 



109 88 

|» <87 
ft 

# 

15 



60 



64«ftLl 



\ : 
I 66 

I 



82 



41 42 



112 



I S-stremds above paper 

: B- Strands behind paper 

. vertical linkers 

- horizontal linkers 

# non- sequential links for V-rain 



The emboldened sequence is the V-min construct. 
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PlgORB 4 - ALIGNMESTT OF FAB LIGHT CHAINS FOR P19,9 AND IMCP 

1 50 
Imcpl DTVMTQSPSS LSVSAGERVT MSCKSSQSLL NSGNQKNFIiA WYQQKFGQPP 

F1991 DIKMTQSPSS MYASIiGERVT ITCK ASODI NSYL S WFQQKPQKSP 

<* S3 > < * Ll-b • 84 44 

99 

imcpl KLLIYGASTR ESGVPDRFTG SGSGTDFTLT ISSVQAEDIiA VYYCQNDHS 
F1991 KTLI YRANR L VDGVPSRFSG SGSGQDYSLT ISSLEYEDMG lYYCL OYDB 

* Lll-a >• * < 85 • 93 

147 

imcpl YPLTFGAGT KLEIKRADA APTVSIFPPS SEQLTSGGAS WCFLNNFYP 
F1991 FPRT FGGGT KLENKRADA APTVSIFPPS SEQLTSGGAS WCFLNNFYP 

LIII > 141 

195 

imcpl KDINVKWKID GSBRQNGVLN SWTDQDSKDS TYSMSSTL. . TLTKDBYERH 
.F1991 KDINAKWKID GSBRQNGVLN SWTDQDSKDS TYSMSSTLSL TLTKDEYERH 

191 

220 

imcpl NSYTCEATHK TSTSPIVKSF NRNEC 
F1991 NSYTCEATHK TSTSPIVKSF NRNEC 

216 



The MCP and F199 sequences are listed in SBQ ID NOSs 39 and 40 
respectively . 
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PIGDRE 4 COKTIHUKL) . _ 

AI.IGKMgHT OP FAB HEAVY CHAINS FOR yi9.9 AND IMCP 

1 50 
imcph EVKLVESGGG LVQPGGSLRL SCATSGFTFS DFYMEWVRQP PGKRLEWIAA 
F199h QVKLEESGGG LVQPGGSMKL SCAAS GFTFS DAWMDWVROS PEKGLEWVAE 

< Hl-a * S2 50 

100 

Imcph SRNKGNKYTT EYSASVKGRF XVSRDTSQSI LYLC^OIALRA EDTAIYYCAR 
F199h I GNKGNNH AT YYAESVKGRF TVSRDTSKSR VYL®1NSLRV EDTGTyYCTT 

Hll-d > * * < 8l •100 

143 

imcph NYYG STWYFDVWGA GTTVTVSSES ARNPTIYPLT LPP.ALSSDP 

F199h R, , . FA YWGO GTLVTVSAAK TTPPSVYPLA PGSAAQTNSM 

Hiri > 136 

193 

Imcph VIIGCLIHDY FPSGIMNVTW GKSGKDITTV NFPPAIjASGG RYIMSNQLTL 
F199h VTLGCLVKGY FP.EPVTVTW NSGSIiSSGVH TFPAVLQSD. LYTLSSSVTV 

185 

222 

imcph PAVECPEGES vkcsvqh. ds npvqeldvnc 

F199h PSSTWP.SET VTCMVAHPAS STKVDKKIVP RDC 

217 

Complementarity Determining I*oops are underlined. 

Canonical Structure Type (reference 4) Is indicated alphabetically 
Framework residues important to Loop conformations are indicated by 

Segments used in the v-min design are demarcated by < >. 

The order of appearwce in the V-min sequence is indicated by stt. 



The MCP and F199 sequences are listed as SEQ ID NOS. 41 and 42 
respectively. 
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FIGDRB 5 

(a) AMIMO ACID SBOTlKHCg FOR F19 .9 V-MIH CQgrSTOUCTS 

QVQLQSS N-Terminal modifications to facilitate expression 
QVQLH 

TGTYYCTTRPAYWGQCrniVGGSMKI.SCJU^aPTPSn^^ 

TAG sequence EQKLISEEDLN* 



(b) AKIHD ACID SKQQgNCg FOR IMCP V -Ittin 

XAIYYCAHangYGSTOa e yU V WQAGTTVGGSIiRIiSCATSgFTFSDFaiSWVR 

QFPGKRIiBWXAASRHKaBinTTEYSASVDI^^ 
JOiFXJUmQQKPGQFPiaCiLrraASTRSSOVFDRFTRDIi^^^ 

TFGAOTKIi 

EQKLiXSEEDLN* 



The F19.9 and IMCP sequences are listed as SEQ, ID NOS 43 and 44 
r e spe c t i ve ly • 
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PIQPRB 6 INITIAL NUCLEOTIDB SEQUENCE FOR n9.9 V-min 

ACTGGCACTTATTACTCTACCACCSCGCnTT 

1 ^ * + 54 

TGACCGTGAATAATGACATGCmKrGCCAAACGAATGACCCCM 
TGTYYCTTRFAYWGQGTL 



GTCGGAGGATCCATOAAACrrCrCTTGTGCrGCCrCTGGAT^^ 

55 + + + 102 

CAGCCTCCTAGGTACTTTGAGAGAACACGACGGAGACCT 
VGGSMKLSCAASGFTF 



AGTGACGCCTGGATGGACTGGGTCCGCCT^TCTCCAGAGAAGGGGC^ 

103 + -+ + + — 

TCACTGCGGACCTACCTGACCCAGGCGGTCAGAGCTCTC^ 

SDAWMDWVRQSPEKGIiE WVA 



GAAATTGGAAACAAAGGTAATAATCMX3CAACGTACTATGCTGACT 

163.^ + + +- + + 228 

CTTTAACCTTTGTTTCCATTATTAGTACGTTO 

BIGNKGNNHATYYAESVDIKMT 



CAGTCTACTATCa^CTTGCAAGGCGAGTCAGGAC^^ 

- H -♦- 

GTCAGATGATAGTGAACGTTCCGCTCAGTCCTGIAATTATCG^^^ 



QSTITCKASQDI NSY LSWFQ 
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PIGORE 6. CUJMTIMUKU 

• • • • • • 

CMJ^CCAGGGAAATCTCCTAAGACCCTGATCTACCGTC 

289 -H + H + + 

GTCTTTGGTCCCTTTAGAGGATTCTGGGACTAGATGG^ 

QKPGKSPKTLIYRANRLVDG' 

• • * • • « 
GTCCCATCAAGGTTCTCreAAGATATGGGAA^^ 

349 -H + + + + + 

CMGGTAGTTCa^AGJ^CirCTATACCC^ 

VPSRFSEDMGIYYCLQYDEF 

• • ■ 
CCTCGGACGTTCGGTGGAGGCACCAAGCTG 

409 - + + + 438 

GGAGCCTGCAAGCCACCTCCGTGGTTCGAC 
PRTFGGGTKL 
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SUBSTJTUTE SHEET 
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FIGURE 9 




IMCP Pv iMCP Vmin 



SUBSTITUTE SHEET 



wo 94/12625 



PCT/GB93/02375 




FI9.9 VMIN Glu 55 ENVIROHMEaST 



SUSSTITUTE SHEET 
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FIGURE lOA 




SUBSTITUTE SHEET 
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FIGDRE 11 




IMCP VMIK ENVIROirMENTS FOR RESIDUES 
LYS 101 AND Gin 108 



SUBSTITUTE SHEET. 
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FIGURE 12 

Fv CARTOON 



V 
Ii 




wo 94/12fi25 



PCT/GB93/02375 




Yj p-Strand Arrangement 
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YjT p-Strand Arrangement 
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FIGURE 15 

Fvj3-Sandwich 



H3 HI H2 




— ■^^—/Teavy Chain 
■ ■ " ^Lisrht Chain 



SUBSTITUTE SHEET 
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riGURE 16 



Con s e jrva fc 1 ve 
Vmin Construct 




pii2j LIOZ 

I Heavy Chain " redundant segments 

ir,±ffht Chain ■■■ reoriented segments 



r.iiQOT-niTE SHEET 
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FIGURE. 17 



Cons erva. t ± vg 
Vmin 

H3 HI H2 




Heavy Chain 

w» mm m» m ^Llffht Chsiln iiimimii w mimi hm xeoxi ented segments 



SUBSTITUTE SHEET 



Y/O 94/12625 



PCT/GB93/02375 



21/55 

FIGUEE 18 



Vmin Construct 

H3 HI H2 




H^OS 



""■■^^—H'eavy Chain — — redundant segments 

— 'hlgrht Chain u miwwwiTmm i iw x-eoriented segments 



SUBSTITUTE SHEET 
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7IGURE 19 



Minimal 

Vmin Construct 




.....I... 

HII2: 



START 



FINISH 



1 

I1IO7 



* Heavy Chain 



M M M « ^Llffbt Chain 



reoriented segrments 



SUBSTITUTE SHEET 
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JIG. 20 



COHPLETE mm SEQUENCE OF McFC603 Vmln GENE 
SHOWING OLIGONnCLEOTIDE SEQUENCES ' 



10 



30 



50 



ATCGATGAATTCCTGCAGCATATGA. 
CATGTAGCTACTTAAGGACGTCGIATACTCATGA 
70 90 



GTACTGCTATCTACTACTGTGCACGTAACT 
CGATAGATGATGACACGTGCATTGA 
110 



ACTATGGCTCCACTTGGTACTTCGACGTTTGG 
TGATACCGAGGTGAACCATGAAGCTGCAAACCCCACGA 
130 150 



GGTGCTGGTACCACTGTAGGCGGCTCCC 
CCATGGTGACATCCGCCGAGGG 
170 



TCCGTCTGTCTTGCGCAACTAGTGGCTTCAC 
AGGCAGACAGAACGCGTTGATCACCGAAGTGGAAGA 
190 210 



CTTCTCTGACTTCTACATGGAGTGGGTAC 
GACTGAAGATGTACCTCACCCATG 
230 



GCCAGCCGCCGGGTAAACGTCTCGAGTGGAT 
CGGTCGGCGGCCCATTTGCAGAGCTCACCTAGCGTCG 
250 270 



CGCAGCXAGCCGIAACAAAGGTAACAAGT 
ATCGGCATTGTTTCCATTGTTCA 
290 



ATACCACCGAATACAGCGCTTCTGTTGATATC 
TATGGTGGCTTATGTCGCGAAGACAACTATAGCAGTA 
310 330 



GTCATGACCCAGTCTATGTCTTGCAAAT 
CTGGGTCAGAXACAGAACGTTTA 
350 



CTTCTCAGTCTCTGCTGAACTCTGGTAACCAGAA 



GAAGAGXCAGAGACGACTTGAGACCATTGG 

370 390 
• • • 

AGCCTGGCCAGCCGCCGAAACTCCTGATCTACGGT 

TCGGACCGGTCGGCGGCTTTGAGGACTAGA 

«0 450 



AAACTTCCTCGCGTGGTATCAGCAAA 
TCTTTTTGAAGGAGCGCACCATAGTCGTTT 

410 

GC6TCTACGCGTGAATCTGGTGTTC 
TGCCACGCAGATGCGCACTTAGACCACAAG 

470 



CGGACarrTTCACCGAAGACCTGGCT 



GXTTACIACTGTCAAAATGATCACTCTTACCCGC 



GCCTGGCAAAGTGGCTTCTGGACCGACAAATG 



ATGACAGTTTTACXAGTGAGAATGGGCG 
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FIGURE 20 . CONT] 



490 



Ivllt'j4l 



510 



530 



TGACCTTTGGCGCCGGCACCAAACTG 
ACTCKIAAACCGCGGCCGTGGTTTGACCTTGT 
550 

AATAATACGTAAGATCTG | 
TTATTATGCATTCTAGACCTAG 



GAACAGAAACTGATCTCTGAAGAAGACCTGAACT 
CTTTGACTAGAGACTTCTTCTGGACTTGA 
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FIG. 21 PEPTIDE SEQOEKCE OF McPC603 Vnln 
(Indndlng c-ayc peptide tag> 

1 TAIYYCABNY YGSTBYFDVy GAGTTVGGSL RLSCATSGFT FSDFYMEWVR 

51 QPPGKRLEWI AASBNKGNKY TTEYSASVDI 7MTQSHSCKS SQSLLNSGNQ 

101 KNFLABYQQK PGQPPKLLIY GASTRESGVP DRFTEDLAVY YCQNDHSYPL 

151 TFGAGIKLEg KLISEEDLN* * 

c-myc tag 
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RllBSTJTUTE SHEET 
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SUBSTITUTE SHEET. 
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FIGURE 25 
EcoRI 

AATTCTCCCA AATATTCTCA AATCAGCTCT TGACAATTAA TCATCCAACT 

ACTTAACTAC TACCCAAGTT CACCTAAAAA GCCTATCCAC 

Kpnl BamHI Xbal Sail PstI SphI 
AATOCTACCC CGrcATCCTrTAGACT^ CTCCACCCATCCAACCTTAG 

Clal/AccI 
CGGGCCTAAT GACCCCGCTT iHi n A TCC AC 



SUBSTITUTE SHEET 
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FIGURE 26 




Smo Smg SolU 




SUBSTITUTE SHEET 
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' FIGURE 28 



TRANSCRIPTION TERMINATION SEQUENCE 



Sail 

5' TCGACATTATATTACTAATTAATTCGGGACCCTAGAGGTCCCCTTTTTTATTTTAA 
3 ' GTAATATAATGATTAATTAACCCCTGGGATCTCCAGGGGAAAAAATAAAATT 



S£hl Hindlll 
AAAGCATGCA 3' 
TTTCGTACGTTCGA 5 ' 



SUBSTITUTE SHEET 
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Styl / Avrll 
StuI 

PvuII 
BstXI 



5599 / EcoRI 
Muni 



Aatll 




4000 



Mlul 
Clal 
MscI 



EcoNI 



RsrII 



2000 
DrdI 



3O0O 
Gsul 

Bsc I 
Earn! 1051 



igijre 



30 
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Fig.l\ 



AaMT 




BamHI 
2000 ^^^^ 
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Fig. 32 CQ NSTRUcrnnrsi OF pirn 1^7 9 



I 

603Vmin 




loe-rin m= P.KEET. 
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FIG,33 COMPLETE DMA SEQUENCE OF A 19>9 Vmin GENE 
SHO¥ING OLIGONnCLEOnDE SEQUENCES 



10 



30 



ATCGATGAATTCCTGCAGCATATGA 



50 



GTACTGGTACTTACTACTGCACTACTCGCT 



CCATGTAGCTACTTAAGGACGTCGTATACTCATGA 



70 



90 



CCATGAATGATGACGTGATGAGCGA 



110 



TTGCATATTGGGGCCAAGGTACTCTGGTTG 



GTGGTTCCATGAAACTCTCTTGCGCTGCCT 



AACGTATAACCGCGGTTCCATGAGACCAACCACCA 



130 



150 



AGGTACTTTGAGAGAACGCGACGGA 



170 



CAGGCTTCACTTTTAGCGACGCCTGGATG 



GACTGGGTCCGTCAGICTCCGGAGAAAGGTC 



I 



GTCCGAAGTGAAAATCGCT6CGGACCTACCTGACC I CAGGCAGTCAGAGGCCTCTTTCCAG 



190 210 
TGGAATGGGTGGCCGAAATTGGCAACA 



230 



AAGGTAACAACCACGCAACCTACTATGCTGAGT 



ACCTTACCCACCGGCTTTAACC G TT G TTTCC 



250 

CTGTTGATATCAA 



270 



ATTGTTGGTGCGTTGGATGATACGACTCA 



290 



AATGACCCAGTCTACTATCACGTGTAAGGCTAGCCAGGACATCAAC^ 



GACAACTATAGTTTTACTG I GGTCAGATGATAGTGCACATTCCGATCGGTCCTGTAGTTGT 



310 330 350 

CCGTG 

CGAT I GGA(nx:GACCMGGTCGTCTTTGGGCCCTTT&GAiMXrrT^^ 



GCTACCTGAGCTGGTTCCAGCAGAAACCCGGGAAATCTCCGAAAACA 



370 390 410 

• . . . 

CAAACCGTCTGGTTGATGGTGXACCCTCGAGtirTCTCTGAAGATATGGGT 



ACTACT 



GTTTGGCAGACCAACTACCACATGGGAGCTCCAAGAGACTTCTATACCCAIAAATGATGA I 



I 
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FIGURE 33 CQMTINDED 



430 



450 



470 



GTCTACAGTATGACGAGTTCCCGCGTACCTTCGGTGGCGGCACCMGCTT 



GAACAGAAAC 



CAGATGTCATACTGCTCAAGGGCGCATGGAAGCCACCGCCGTGGTTCGAACT^ 

490 510 

• • • • 

TGATCTCTGAAGAAGACCTGAACEAATAATACGTAAGATCTG 

ACTAGAGACTTCTTCTGGACTTGATTATTATGCAOT 



CTTTG 
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riG,34 PEPTIDE SEQUENCE OF 19.9 Vmin 
(inclttdlng c-nryc peptide tag) 

1 TCTYYCTTRT AYVGQGTLVG GSMKLSCAAS GFTFSDAyMD yVRQSPEKGL 

51 EWVAEIGNKG NNHATYYAES VDHaiTQSTI TCKASQDINS YLSOTQQKPG 

101 KSPKTLIYRA NRLVDGVPSR FSEDMGIYYC LQYDEFPRTF GGGTKLEQKL 

151 ISEEDLN* * 
c-i^c tag 
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Flqmre 35 

1 ATG AAA TAG CTA TTG CCT ACG GCA GCC GCT GGA TTG TTA TTA CTC GCT 48 
Met Lys Tyr Leu I*eu Pro Tlir Ala Ala Ala Gly Leu Leu Leu Leu Ala 

49 GCC CAA CCA GCG ATG GCC CAG GTG CAG CTG CAG GAG TCA GGA CCT GGC 96 
Ala Gin Pro Ala Met Ala Gin Val Gin Leu Gin Glu Ser Gly Pro Gly 

97 CTG GTG GCG CCC TCA CAG AGC CTG TCC ATC ACA TGC ACC GTC TCA GGG 144 
Leu Val Ala Pro Ser Gin Ser Leu Ser He Thr Cys Thr Val Ser Gly 

145 TTC TCA TTA ACC GGC TAT GGT GTA AAC TGG GTT CGC CAG CCT CCA GGA 192 
Phe Ser Leu Thr Gly Tyr Gly Val Asn Trp val Arg Gin Pro Pro Gly 

193 AAG GGT CTG GAG TGG CTG GGA ATG ATT TGG GGT GAT GGA AAC ACA GAC 240 
Lys Gly Leu Glu Trp Leu Gly Met lie Trp Gly Asp Gly Asn Thr Asp 

241 TAT AAT TCA GCT CTC AAA TCC A6A CTG AGC ATC AGC AAG GAC AAC TCC 288 
Tyr Asn Ser Ala Leu Lys Ser Arg Leu Ser He Ser Lys Asp Asn Ser 

289 AAG. AGC CAA GTT TTC TTA AAA ATG AAC AGT CTG CAC ACT GAT GAC ACA 336 
Lys Ser Gin Val Phe Leu Lys Met Asn Ser Leu His Thr Asp Asp Thr 

337 GCC AGG TAC TAG TGT GCC AGA GAG AGA GAT TAT AGG CTT GAC TAC TGG 384 
Ala Arg Tyr Tyr Cys Ala Arg Glu Arg Asp Tyr Arg Leu Asp Tyr Trp 

385 GGC CAA GGC ACC ACG GTC ACC GTC TCC TCA TAA TAA GAG CTA TCC CGG 432 
Gly Gin Gly Thr Thr Val Thr Val Ser Ser 

433 GAG CTT GCA TGC AAA TTC TAT TTC AAG GAG ACA GTC ATA ATG AAA TAC 480 

Met Lys Tyr 

481 CTA TTG CCT ACG GCA GCC GCT GGA TTG TTA TTA CTC GCT GCC CAA CCA 528 
Leu Leu Pro Thr Ala Ala Ala Gly Leu Leu Leu Leu Ala Ala Gin Pro 

529 GCG ATG GCC GAC ATC GAG CTC ACC CAG TCT CCA GCC TCC CTT TCT GCG 576 
Ala Met Ala Asp He Glu Leu Thr Gin Ser Pro Ala Ser Leu Ser Ala 

577 TCT GTG GGA 6AA ACT GTC ACC ATC ACA TGT CGA GCA AGT GGG AAT ATT 624 
Ser Val Gly Glu Thr Val Thr He Thr Cys Arg Ala Ser Gly Asn He 

625 CAC AAT TAT TTA GCA TGG TAT CAG CAG AAA CAG GGA AAA TCT CCT CAG 672 
His Asn Tyr Leu Ala Trp Tyr Gin Gin Lys Gin Gly Lys Ser Pro Gin 

673 CTC CTG GTC TAT TAT ACA ACA ACC TTA GCA GAT GGT GTG CCA TCA AGG 720 
Leu Leu Val Tyr Tyr Thr Thr Thr Leu Ala Asp Gly Val Pro Ser Arg 

721 TTC AGT GGC AGT GGA TCA GGA ACA CAA TAT TCT CTC AAG ATC AAC AGC 768 
Phe Ser Gly Ser Gly Ser Gly Thr Gin Tyr Ser Leu Lys He Asn Ser 

769 CTG CAA CCT 6AA GAT TTT GGG AGT TAT TAC TGT CAA CAT TTT TGG AGT 816 
Leu Gin Pro Glu Asp Phe Gly Ser Tyr Tyr Cys Gin His Phe Trp Ser 

817 ACT CCT CGG ACG TTC GGT GGA GGG ACC AAG CTC GAG ATC AAA CGG GAA 864 
Thr Pro Arg Thr Phe Gly Gly Gly Thr Lys Leu Glu He Lys Arg Glu 
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Figure 35 (continued) 

: 865 CAA AAA CTC ATC TCA' GAA GAG GAT CTG AAT TAA TAA TGA TCA AAC GGT 912 
Gin Lys Leu lie Ser Glu Glu Asp lieu Asn 

913 AAT AAG GAT CCA GCT CGA ATT CAC TGG CCG TCG TTT TA 950 
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Ficnire 40 Dl . 3 V-min conservative 



MKYLLPTAAAGLLLLAAQPAMA pel B secretion leader 

SRLS ISKDNSKSQVFLKMNSLHTDDTARYYCARiERDY 
h65 0 . 0. . , 0 0. - 

RLDYWGQGTTLTVS APSQSLSITCTVSGFSLTGYQV 
. . - . - 0. . .S. , .hll5 hl3 0 . 0 

NWVRQPPGKGLEWLGMIWGDGNTDYNSALK DIQMTQ 
0 0. . . ; 0 A.h64 11 

SPASLSASVGETVTITCRASGNIHNYLAWYQQKQGKS 
0 , . 0 0 0 

PQLLVYYTTTLADGVPSRFSGSGSGTQYSLKINSLQP 
0 0 0 0 

EDFGSYYCQHFWSTPRTFGGGTKLEIKR 
• 0 ; 0 , 1108 

EQKL.ISEEDLN tag sequence 
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Plotire 41 

a) h65-115/hl3-64/ll-l08 Intact Light Chain 

MetLysTyrLeuLeuProThrAlaAlaAlaGlyLeuLeiiLeuLeiiAlaAlaGlnProAlaMet^ . , . , 

pel B secretion leader 
AspTyrProGlySerSerGly Secretion facilitator 

S erArgLeiiSer I leSerLysAspAsnSerLysSerGlnVal PheLeuLysMet AanS erLexiHi sThrAspAspThr 
h65 70 80 90 

AlaArgTyrTyrCysAlaArgGlxiArgAspTyrArgLeuAspTyrTr^ . . 

' • — - 100 110 hllS 

AlaProSerGliiSerLeuSerlleThrGysThrValSeraiyPheSerLeuThrGlyTyrGlyValAsnTrpValArg 
hl3 20 30 

GlnProProGlyLysGlylieuGluTrpLeuGlyMetlleTrpGlyAspGlyAsnThrAspTyrAsnSerAlaLeuLys 
. . . -40 50 60 h64 

AspIleGlnMetThrGlnSerProAlaSerLeuSerAlaSerValGlyGluThrValThrlleThrCysArgAlaSer 
11. . .10 20 

GlyAsnlleHisAsnTyrljeuAlaTrpTyrGlnGlnLysGinGlyLyBSerProGliiIieiiLeuValT^ 
. • . 30 40 50 

ThrLeuAlaAspGlyValProSerArgPheSerGlySerQlySerQlyThrQlnTyrSerLeuLysIleAsnSe 
60 70 

GlnProGluAspPheGlySerTyrTyrCysGliiHisPheTrpSerThrProArgThrPheGlyGlyGlyThrLysL^ 
80 90 100 

GluIleLysArg. , . . 
1108 

GluGlnLysIieuXleSerGluGliiAspLeuAsn tag sequence 
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b) 161-107/113-60/hl-ii6 Intacc Heavy Chain Dl . 3 

MetLysTyrLeuLeuProThrAlaAlaAlaGlyLeiiLeiiLeiiLeiiAlaAlaGlnProAlaMeti^ .... 

pel B siecretion leader 
GlySerSerSerGly Secretion facilitator 

ArgPheSerGlySerGlySerGlyThrGlnTyrSerLeuLysIleAsnSerLeuGlnProGliiAspPheGlySe^^ 
161 70 80 

TyrCysGlnHisPheTrpSerThrProT^rgThrPheGlyGlyGlyThrLysLeuGluIleLy .... 
90 , 100 1X07 

AlaSerValGlyGluThrValThrlleThrCysArgAlaSerGlyAsnlleHisAsnTyrLeuAlaTrpT^ 

113 20 30 

LysGlnGlyLysSerProGliiIieiiIieuValTyrTyrThrTlirThrLeuAJ.aAspGlyValProS . . . 
40 . , 50 160 

GlnValGlnlieuIiysGluSerGlyProGlyLeuValAlaProSerGlnSerl/eiiSerlleThrCysTlirValSerGly 
ill - • • • • 10 20 

PheSerLeuThrGlyTyrGlyValAsnTrpValArgGlnProProGlyI*ysGlyLeuGluTrpI.euGly^ 
30 40 50. . i . . . 

GlyAspGlyAsnThrAspTyrAsnSerAlalieuLysSerArglieiiSerlleSerLysAspAsnSerLysSerGlnVal 

• • -60 70 

PheLeuLysMetAsnSerleuHisThrAspAspThrAlaArgTyrTyrCysAlaArgGltiA^ 

• V- -BO 90 100 

TyrTrpGlyGlnQlyThrThrLeuThrValSerSer 
• 110 hll6 



GluGlnLysLeuIleSerGluGliiAspLeuAsn tag sequence 
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Figure 43 

c) h87-llS/hl3-64/ll-7/121-6l/176-108 Minimal Construct D1.3 

MetLysTyrLeuIieuProThrAlaAlaAlaGlyLeuLeiiLeuLeuAlaAlaGlnProA^ . . . 

pel B secretion leader 



ThrAspAspThrAlaArgTyrTyrCysAlaArgGluArgAspTyr 

>87 90 iOO 110 

ThrValSer,,. 
.... .hllS 

AlaProSerGlnSerLeuSerlleThrCysThrValSerGlyPheSerlieuThrGlyTyrGlyVal^ 

lil3. . . 20 30 

GlnProProGlyLysGlyLeuGluTrpLeuGl^etlleTrpGlyAspGlyAsnThrTlspTyrAsnSerAlaLeuLys 
■ • 50 60 h64 

AspIleGlnMetThrGlnSer. . . IleThrCysArgAlaSerGlyAsnlleHisAsnTyrLeuAlaTrpTyrGlnGln 
11 7. .,121 30 

LysGlnGlyLysSerProGlnLeiiLeuVallYrTyrThrThrThrLeuAlaAspGlyValProSer . . • 
40 • • 50 161 

AsnSerLeuGlnProGlxiAspPheQlySerTyrTyrcysQlnHisPheTrpSerThrProAr^^ 

^■^e 80 90 100... 

ThrLysLeuGluIleLysArg 
1108 



GiuGlnLysLeuIleSerGluGluAspLeuAsn tag sequence 
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Fiarure 44 

a) h67-125/hl3-66/ll-112 Intact Light Chain KOL 

MetLysTyrLeuLeuProThrAlaAlaAlaGlyl^uLeuLeiiLeuAlaAlaGlnProAlaMetA^ . . . 

pel B secretion leader 
GlySerSerSerGly Secretion facilitator 

ArgPheThrlleSerArgAsiiAspSerLysAsnThrLeuPheLeuGlnMetAspSerLeuArgProGliiAspTh^ 
h67 .70 80 90 

valTyrPheCYsAla^UrgAspGlyGlyHisGlyPheCysSerSerAlaSerCysPheGlyProAspTyrTrpGlyG 
100 110 

GlyThrProValThrValSer . . . GlnProGlyArgSerLeuArglieuSerCysSerSerSerGlyPhellePfaeSer 
. . .120 hl25 hl3 20 .30 

SerTyrAlaMetTyrTrpValArgGlnAlaProGlyLysGlyLeuQluTrpValAlallelleTrpAspAspGlySer 
- 40 50 

AspGlnHisTyrAlaAspSerValLysGly. . .GlnSerValLeuThrGlnProProSerAlaSerGlyThrProGly 
60 h66 11 10 

GlnArgValThrlleSerCysSerGlyThrSerSeriVsnlleGlySerSerThrValAsnTrpTyrGlnGlnleuPr 
• • . • 20 30 40. . . 

GlyMetAlaProLysLeiiLeuIleTyrArgAspAlaMetArgProSerGlyValProAspArgPheSerGlySerLys 
50 60 

SerGlyAlaSerAlaSerLeuAlalleGlyGlyLeuGlnSerGluAspGluThrAspTyrTyrCYsAlaAlaT 

. 70 80 90 

ValSerLeuAsnAlaTyrValPheGlyThrGlyThrLysValThrVallieuGlyGln 
■ • . 100 110. .1112 



GluGlnLysIjeuIleSerGluGluAspLexiAsn tag sequence 
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Figure 45 

b) 162-ld9/112-61/hl-125 Intact Heavy Chain KOL 

MetLysTyrLeiiLexiProThrAlaAlaAlaGlyLeuLeiilieuLexiAlaAlaGlnProAlaMetA^ 

pel B secretion leader 

GlySerSerSerGly Secretion facilitator 

ArgPheSerQlySerLysSerGlyAlaSerAlaSerLeuAlalleGlyGlyLeuGlnSerGliaAspGluThrAspTyr 
162 , 70 80 

TyrCysAlaAlaTrpAspValSerLeuAsnAlaTyrValPheGlyThrGlyThrLysVal GlyThrProGlyGln 

.... . . .90 100 1109 112 

ArgValThrlleSerCysSerGlyThrSerSerAsnlleGlySerSerTlirValAsnTrpTyrGlnGlnLeuProGly 
... ... 20 30 40 

MetAlaProLysLeiilieuIleTyrArgAspAlaMetArgProSerGlyValProAsp GluValGlnLeuValGln 

50 161 hX 

SerGlyGlyGlyValValGlnProGlyArgSerLexiArgLeuSerO/sSerSerSerGlyPliellePheSerSerTyr 
. 10. 20 30 

AlaMetTyrTrpValArgGlnAlaProGlyLysGlyljeuGluTrpValAlallelleTrpAspAspGlySerAspGln 

. 40 50 

Hi sTyr AlciAspSerValLysGlyArgPheThr I leSerArgAsnAspSerliysAsnThrl^exiPheLeuGlnMe tAsp 

. . . ,60 70 80 

SerLeuArgProGlxjAspThrGlyValTyrPheCysAlaArgAspGlyGlyHisGlyPheCysSerSerAlaSer^ 
90 100 - 110 

PheGlyProAspTyrTrpGlyGlnGlyThrProValTlirValSer 
120 hl25 

GluGlnLysLeuIleSerGluGluAspLeviAsn tag sequence 
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Figure 46 

a) h66-118/hl3-65/ll-112 Intact Light Chain PAC-1 

MetLysTyrLeulieuProThrAlaAlaAlaGlyLeuLeuLexiLexiAaaAlaGlnProTaa^ 

pel B secretion leader 

GlySerSerSerGly Secretion facilitator 

ArgLeuSerlleSerLysAspAsnSerLysSerGlnValPhePheLysMetAsnSerLeuGlnAlaAsnAspThrAla 
70 80 90. 

IleTyrTyrCysAlaArgArgSerProSerTyrTyrArgTyrAspGlyAlaGlyProT^ 

• 100 110 

Gly . . .GlnProSerQlnSerlietiSerlleXhrCysThrValSerGlyPheSerLeuThrSerTyrGlyValHisTrp 
hlia hl3 20 . , . .30 

ValArgGlnSerProGlyLysGlyLeuQluTrpLeuGlyVallleTrpSerGlyGlySerThrAspTyrAsnAlaAla 
..........40 .50 60 

PhelleSer. . . AspValLeviMetThrGlnGlnThrProLeuSerLeuProValSerlieuGlyAspGlnAlaSerlle 
......h65 11 10 20. _ 

SercysArgSerSerGlnSerlleValHisSerAsnGlyAsnThrTyrlieuGluTrpTyrLeuGlnLysProGlyGl^ 
30 ; . . 40 

SerProLysLeiiLeuIleTyrLysValSerAsnArgPheSerGlyValProAspArgPheSerGlySerGlySerGly 
50 60 70 

ThrAspPheThrLeuLysIleSerArgValGltiAlaGluAspLeuGlyValTyrTyrCysPheGlnGlySe 

• • • • 80 90 

ProTyrThrPheGlyGlyGlyThrLysLeuGluIleLys 
100.- 110. .1112 

GluGlnLysLeuIleSerGluGluAspLeuAsn tag sequence 
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Plmire 47 

b) 166-lli/li2-65/hl-123 Intact Heavy Chain PAC-1 

MetLysTyrLeuLeuProThrAlaAlciAlaGiyLeiiLeuLeuLeiiAlaAlaGlnProA^ . . . 

. pel B secretion leader 

GlySerSerSerGly Secretion facilitator 

ArgPheSerGlySerGlySerOlyTlirAspPheThrLexiLysIleSerArHValGliiAlaGliiAspLeu^ 

166. 70 80 90. . . 

TyrCysPheGlnGlySerHisValProTyrThrPheGlyGlyGlyT:lirLysLeuGluIl . . ProValSerLeuGly 
100 . . .1111 112 

AspGlnAlaSerlleSerCysArgSerSerGlnSerlleValHisSerAsnGlyAsnThrTyrLeuGluTrpTyrLeu 
20 30 40 

GlnLysProGlyGlnSerProLysIieuljeuIleTyrLysValSerAsnArgPheSerGlyValProAsp . . .GlnVal 
50 .60 165 hi* . • . 

Glz^eiLLysGlnSerGlyProGlyLeuValGlnPr oS erGlnS erLeuS er X 1 eThr CysThrVal S erGlyPheS er 
10 20 

IieuThrSerTyrGlyValHisTrpValArgGlnSerProGlyliysGlyLeuGluTrpIjeuGlyVallleTrpSerGly 
. . . .36 40 .50 

GlySerThrAspTyrAsnAlaAlaPhelleSerArgLetiSerlleSerLysAspAsnSerLysSerGlnValPhePhe 
60 70. 80 

LysMetAsnSerljeuGlnAlaAsnAspThrAlalleTyrTyrCysAlaArgArgSerProSerTyrT^ 
90 100 

GlyAlaGlyProTyrTyrAlaMetAspTyrTrpGlyGlnGlyThrSerVal 
.... .110 120 hl23 

GluGlnLysLeuXleSerGluQluAspLeuAsn tag sequence 
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MOLECULAR DEVICES 
Raw Data (Hate) 



DATA FILE 
DESCRIPTION 
PROTOCOL 
DESCRIPTION 
MODE 
WAVELENGTH 
MEAN TEMP 



Vcon assay 3.9.93 
VCON ASSAY 3.9.93 



Endpoint 
490 

22.50"C 



AUTOMIX: ON 



PRINTED: 10/11/93 



CAUBRATION: ON 
SET TEMP: OFF 



Optical Density 





1 


2 


3 


4 


5 


6 


7 


8 


9 


1 0 


1 1 


12 


A 


3.856 


3.892 


3.706 


3.237 


2.697 


1.904 


1.214 


0.650 


0.417 


0.293 


0.255 


0.215 


B 


3.946 


3.865 


3.724 


3.300 


2.716 


1.981 


1.196 


0.701 


0.408 


0.298 


0.242 


0.220 


C 


0.195 


0.180 


3.940 


0,165 


0.165 


0.168 


0.160 


0.163 


0.163 


0.172 


0,174 


0.168 


D 


0.168 


0.176 


0.180 


0.174 


0.165 


0.177 


0.164 


0.166 


0.174 


0-172 


0.189 


0.175 


E 


0.172 


0.167 


0.167 


0.161 


0.165 


0.168 


0.159 


0.161 


0.167 


0.195 


0.177 


0.164 


F 


0.199 


0.178 


3.978 


0.167 


0.175 


0.540 


0.172 


0.180 


0.176 


0.173 


0,171 


0.170 


G 


0,173 


0.181 


0.174 


0.398 


0.167 


0.178 


0.174 


0.180 


0.171 


0.174 


0.163 


0.171 


H 


0.171 


0.180 


0.171 


0.174 


0.176 


0.174 


0.180 


0.185 


0.180 


0.201 


0.180 


0.158 
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